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ABSTRACT 

This  paper  considers  two  aspects  of  the  word  problem 
for  groups:  first,  some  similarities  between  derivations 
from  the  relations  of  a  group  and  proofs  from  a  set  of 
axioms  in  logic,  and  second,  the  computational  difficulty 
of  word  problems  and  the  problem  of  constructing  groups 
with  solvable  word  problems  of  some  preassigned  degree  of 
difficulty . 

Given  a  presentation  of  a  group  in  terms  of  generators  and 
relations,  we  define  two  words  on  the  generators  to  be  equal  if 
and  only  if  one  can  be  transformed  into  the  other  in  a  finite 
number  of  steps  using  the  relations  of  the  group.  Defining  equal- 
ity in  a  group  as  derivability  from  a  set  of  axioms  suggests 
formulating  the  word  problem  for  a  group  as  the  derivability 
problem  for  a  formal  system.  The  system  we  choose  is  equational 
logic.  We  show  that  there  exist  some  striking  analogies  between 
results  from  logic  and  results  about  groups:  nontrivial  groups 
correspond  to  consistent  systems,  groups  with  solvable  word  prob- 
lem to  decidable  systems,  and  simple  groups  to  complete  systems. 
This  suggests  that  results  about  formal  systems  could  be  used  to 
obtain  results  about  decidability  in  groups. 

A  group  has  word  problem  in  level  £   of  the  Grzegorczyk 
hierarchy  if  the  running  time  of  the  algorithm  solving  the 
word  problem  is  in  £".   Groups  having  word  problem  solvable 
in  levels  s"  (n  ^  2)  of  the  Grzegorczyk  hierarchy  are  given. 
The  groups   are  constructed  using  a  standard  procedure  of 
constructing  a  group  given  the  presentation  of  a  semigroup. 
Semigroups  are  constructed  following  J.  Robinson's  method  of 
functional  equations:  the  semigroups  are  decidable  systems  of 
functional  equations  which  define  functions  in  £  .   This 
technique  for  constructing  semigroups  is  particularly  elegant, 
for  it  has  the  advantage  of  exhibiting  the  semigroup  as  a  con- 
crete system.  We  show  that  if  the  semigroup  has  word  problem  in 
s"^  (and  not  lower)  ,  then  the  resulting  group  has  word  problem 
in  fi"^  (and  not  lower)  .  Hence  there  exist  groups  with  arbitrarily 
difficult,  but  solvable  word  problem.s. 
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CHAPTER  1 
INTRODUCTION 

Given  a  presentation  of  a  group  (or  a  semigroup)  G,  we 
define  two  words  w  ,  w   on  the  generators  of  G  to  be  equal 
if  and  only  if  one  can  be  transformed  into  the  other  in  a 
finite  number  of  steps  using  the  relations  of  G,   The  word 
problem  is  the  problem  of  deciding  whether  two  arbitrary  words 
on  the  generators  of  G  are  equal,  or,  alternatively,  if  there 
exists  a  proof  of  "w   =  w  "  from  the  relations  of  G. 

Equating  truth  in  G  with  derivability  from  a  set  of 
axioms  suggests  the  possibility  of  formulating  the  word 
problem   for  G  as  the  derivability  problem  for  some  type  of 
formal  system.   The  first  part  of  the  paper  is  concerned  v;ith 
this.   Once  this  is  done,  we  can  consider  the  questions 
usually  asked  about  formal  systems,  such  as  consistency,  decid- 
ability, and  completeness,  and  use  results  about  particular 
formal  systems  to  obtain  results  about  groups  and  semigroups 
corresponding  to  these  systems.   In  this  analogy,  nontrivial 
groups  correspond  to  consistent  systems,  groups  with  solvable 
word  problem   to  decidable  systems,  and  simple  groups  to 
complete  systems.  The  last  analogy  is  particularly  striking, 
for,  strangely  enough,  although  it  has  been  known  for  a  long 
time  that  complete  recursively  axiomatized  theories  are 
decidable,  it  was  only  recently  noted  that  recursively 
presented  simple  groups  have  solvable  word  problems. 

The  major  part  of  the  paper   is  concerned  with  the  problem 
of  exhibiting  finitely  presented  groups  with  word  problems 
solvable  in  levels  &      of  the  Grzegorczyk  hierarchy. 

A  group  (or  semigroup)  has  word  problem  in  £   if  the 
running  time  of  the  algorithm  solving  the  word  problem  is  in  £  , 
The  group  used  to  prove  recursive  unsolvability  of  the  word 
problem  [Rotman,  1973]  is  constructed  from  a  presentation  of 
a  semigroup  computing  a  nonrecursive  function.  We  use  the  same 
technique  of  obtaining  a  group  presentation  from  a  semigroup 
presentation  and  show  that  if  the  semigroup  has  word  problem 
in  S   (n  >4) ,  then  the  resulting  group  has  word  problem  in  £  . 


2       3 
Groups  with  word  problems  m  £   and  £   are  exhibited.  Semi- 
groups are  constructed  following  Julia  Robinson's  method  of 
functional  equations  [J.  Robinson,  1968].   This  method  has 
the  advantage  of  exhibiting  a  semigroup  (or  a  group)  as  a 
concrete   structure,  namely,  as  a  semigroup  (or  a  group)  of 
transformations  of  certain  structures.  This  approach  has  been 
first  used  by  MacKenzie  and  Thompson  [1973],  and  then  consider- 
ably improved  by  Higman  [1974]  in  exhibiting  an  infinite 
family  of  finitely  presented  infinite  simple  groups. 

The  problem  of  exhibiting  finitely  presented  groups  with 
word  problem  in  £   has  been  looked  at  by  Cannonito  and 
Gatterdam  [1973]  (and  [Gatterdam,  1973]).   They  approached 
it  indirectly  by  showing  that  the  Higman  construction  of 
embedding  a  recursively  presented  group  in  a  finitely  presented 
group  preserves  £  -decidability,  and  by  exhibiting  a  recursive- 
ly presented  £  -decidable  group.   Their  notion  of 
£  -decidability  is  different  from  ours;  they  consider  a  group 
to  have  a  "realization"  in  the  natural  numbers,  and  the 
£  -decidability  of  a  group  is  defined  relative  to  a  fixed 
indexing  of  the  group. 


CHAPTER  2 
GROUPS  AND  SEM.IGROUPS  AS  DEDUCTIVE  SYSTEMS 

Simple  groups  behave  very  much  like  complete  theories. 
This  chapter  is  an   attempt  to  explain  why. 

A  result  due  to  Birkhoff  states  that  a  semigroup  with 
identity,  i.e.,  a  monoid,  is  isomorphic  to  the  semigroup  of 
all  endomorphisms  of  some  unary  algebra.   We  can  thus  view 
the  generators  of  a  semigroup  as  functions,  and  the  relations 
as  relations  between  functions.   Derivations  using  the  semi- 
group relations  consist  of  formal  substitutions;  no  reference 
is  made  to  the  domain  over  which  functions  are  interpreted, 
and  there  exists  the  possibility  that  there  are  several 
different  interpretations. 

A  deductive  system  of  the  type  suggested  above,  i.e., 
one  in  which  the  only  admissible  formulas  are  equations  and 
the  only  rule  of  inference  is  substitution  (of  equals  for 
equals  or  terms  for  variables)  is  called  an  equational    system. 
In  Section  2 . 1  we  present  A.  Tarski's  formulation  of  equa- 
tional logic.   We  show  that  with  each  presentation  of  a 
semigroup  we  can  associate  an  equational  system,  and  that 
derivations  in  the  semigroup  correspond  to  proofs  within  the 
system.   This  provides  a  link  between  logic  and  semigroup 
theory,  for  now  we  can  use  results  about  equational  theories 
to  obtain. results  about  semigroups. 

In  Section  2 . 3  we  examine  the  concepts  of  consistency 
and  completeness  applied  to  equational  theories  arising  from 
semigroup  presentations.  These  are  too  general  for  the  class 
of  equations  we  will  be  concerned  with,  and  new  definitions 
of  consistency  and  completeness  are  given,  relativized  to 
certain  sets  of  equations.   We  then  show  that  there  are  some 
interesting  analogies  between  results  from  logic  and  from 
group  and  semigroup  theory. 


2.1   Equational  Systems  —  Definitions  and  Main  Results 

An  algebra    is  a  system   A  =  <A, 0, , . . . , 0,  >  where  A  is  a 

nonempty  set  and  0^,...,0,       is  a  finite  sequence  of  finitary 

operations  on  A .   A  is  called  the  universe   and   (),...,  0, 

1 '      k 

the  fundamental    operations    of  A.   The  sequence  of  ranks  of 
0 ..,..., 0,     is  called  the  type   of  A.   A  fundamental  operation 
of  rank  0  is  identified  with  the  unique  element  which  consti- 
tutes its  range;  such  an  element  is  called  a  distinguished 
element   of  the  algebra.   If  all  the  operations  have  rank  one, 
then  A  is  called  a  unary    algebra. 

Below  are  some  examples  of  algebras;  to  avoid  parentheses 
we  will  put  an  operation  symbol  in  front  of  the  variables  to 
which  it  refers. 

(i)    Algebras  <A,'>   with  one  binary  operation  satisfying 
••X  y  z  =  'X'y  z   for  all  x,y,z   in  A 
All  sucn  algebras  are  called  semigroups . 
(ii)   A  group  is  an  algebra  <G, • ,    ,  1>   with  one  binary 
operation  •,  one  nullary  operation   1,  and  one  unary 
operation     satisfying: 


(1) 

•  •    X   y    z    =    'X'y    z 

(2) 

•1     X     =      'X     1     =     X 

(3) 

•X    x~       =     -x"       X    =    1 

for  all  elements  x,y,z  in  G. 

(iii)  Let  0^,...,0,     be  a  sequence  of  operations  and  let 

X  -    {x. |i  e  l}.   The  set  of  all   possible  terms  that 

can  be  built  up  from  X  and  C  ,  ,  .  .  , 0,  is  an  algebra. 

For  if  0    is  an  n-ary  operation  and  w, , , . . ,w   are  terms, 

J       c  In 

then  the  term  Ow^ . . .w    is  taken  to  be  the  result  of 

1     n 

applying  0    to  w^  ,  .  .  .  ,v/.  .   The  algebra  so  constructed 
is  called  a  free   algeora   and  X  is  called  a  free 
generatinp    set. 


Many  properties  of  algebras  can  be  expressed  by   means 
of  equations,  i.e.  formulas  of  the  form  a  =  3  where  a  and  B 
are  terms  of  the  algebra  being  discussed.   The  study  of 
abstract  algebras   and  the  equations  which  hold  in  theiri 
was  initiated  by  G.  Dirkhoff  [1335].   He  established  the 
rudiments  of  a  formal  system,  nov;  known  as  equational    logic, 
in  which  equations  are  the  only  admitted  formulas .  There  have 
been  more  recent  formulations  of  equational  logic  since  then, 
tilenberg  and  vJright  [1967]  gave  a  formulation  in  terms  of 
category  theory.   We  present  here  A.  Tarski's  [1963]  formula- 
tion of  equational  logic  as  part  of  predicate  logic;  equations 

"a  =  B"  with  variables  x,,...,x   are  treated  as  if  they  were 

In  -^ 

sentences  Vx^ , . . . ,x   (a  =  3)  . 

1      n 

The  underlying  language  L  consists  of  an  infinite  numioer 
of  variables,  a  finite  number  of  function  symbols,  and  =,  the 
symbol  for  equality.   Terms  and  formulas  are  defined  as  in 
first  order  logic.   An  equation    is  a  formula  of  the  form  a  =  3, 
where  a  and  3  are  terms.   The  equation  "x  -   x"  is  always 
included  among  the  axioms,  and  the  rules  of  inference  are 
(i)    substituting  terms  for  variables,  and 
(ii)   replacing  equals  by  equals. 

Let  a  =<a^,...,a,  >  be  the  sequence  of  operation  symbols 

1         K 

and  Z  a  set  of  equations  over  these  symbols.   The  models  of  Z 

are  algebras  <A, 0, , . . . , 0,  >  such  that  for  each  i,  0.    has  the  same 
^  Ik  1 

rank  as  a.  ,  and  which  satisfy  Z.   We  say  that  a  =  3  is  a 
logical    consequence      of  E  if  a  =  3  is  valid  in  every  model 
of   Z,  and  a  =  3  is  derivable    from  Z  if   there  exists  a  proof 
of  a  =^  3.   The  completeness  theorem  for  equational  logic, 
proved  by  Birkhoff  [1935]  states  that  the  notions  of  logical 
consequence  and  derivability  are  equivalent. 

By  an  equational    theory    6    we  mean  a  set  of  equations 
closed  under  logical  consequence.   A  class  X  of  algebras  is 
called  an  equational    class    if  K  is  the  class  of  all  models 
of  some  set  Z  of  equations .   We  say  that  K  is  the  equational 
class  defined  by  Z. 


Birkhoff  [1935]  has  given  a  purely  algebraic  character- 
ization of  equational  classes:   a  class  K  is  equational  if 
and  only  if  all  subalgebras,  direct  products,  and  homomorphic 
images  of  algebras  in  K  are  in  K.   We  note  here  that  Eilenberg 
and  Schutzenberger  [1976]  have  studied  what  happens  to  this 
theorem  if  one  restricts  one's  attention  to  finite  algebras 
only . 

Contrary  to  the  case  in  first  order  logic,  where  there 
exist   sets  of  sentences  with  no  models,  every  set  of  equations 
Z  always  has  models  —  the  free  algebras,   obtained  from  free 
algebras  of  terms  by  equating  terms  using  the  equations  of  Z. 
These  may  be  trivial,  consisting  of  only  one  element.   The 
free  algebras  satisfying  Z  can  be  thought  of  as  the  "most 
general"  algebras  satisfying  Z,  for  every  algebra   generated 
by  m  elements  and  satisfying  Z  is  a  homomorphic  image  of  the 
free  algebra  on  m  generators  [Birkhoff,  1935] . 


2.2   Associating  an  Equational  System  with  a  Semigroup 

Let  S  be  a  finitely  generated  semigroup  with  generators 

s,,...,s,  and  relations  a.  =  g.  ,  i  g  I ,  an  index  set, 
Ik  1     1  '       ' 

denoted   S  =  (s,,...,s,  la.  =  3-/  i^  I).   Leta  and  3  be  words 

1      k '  1     1 

on  the   generators  of  S .   We  say  that  a  =  B  is  derivable    from 

S  if  there  exists  a  sequence  of  words  y,,...,y   such  that 

^  '  1      '  n 

Yt  =  a  ,  Y   =3  and  y  ■  , -,    is  obtained  from  y.  by  using  a  rela- 
1       '  n  1  +  1  '  1   -^      ^ 

tion  of  S,  and  we  write  S  \ —  a  =  3ifa  =  3is  derivable  from 
the  relations  of  S.   Consider  the  equational  system  with 
function  symbols   s,,...,s,  and  axiom  set 
Z   =  {a . (x)  =  3. (x) |i  G  I}.   We  write  Z   |—  a (x)  =  B(x)  if 

O         X  J-  ^ 

a (x)  =  3(x)  is  provable  from  Z  . 

Theorem  2.2.1    Let  a , 3   be  any  two  words  on  the  generators 
of  S,   Then  S  |—  a  =  3**Zg  (—  a  (x)  =  3(x)  . 

Proof:  (=*)   Suppose  S  j —  a  =  3. 


Then  there  exists  a  sequence  of  words  Yw'-'fYj_  r    where  Y-,  is 

a,  Y   is  B  and  y  •  , ->  is  obtained  from  y-  using  a  relation  in  S 
m            1  +  1  1 

Thus,  Yi 


^2  = 


=  Y  •   The  following  sequence  is  a  proof 


of  a (x)  =  B (x)  from  Z. 


Eg  ^  X  ^  X 


E   j —  a (x)  =  a (x)    (Substituting  term  for  variable) 


Z   |—  a  (x)  =  Y2  (^) 


Zg  f-  a(x)  =  Y3(x) 


Z   |—  a(x)  =Y  _i  (x)  )■  (Substituting  equals  for  equals) 


Zg  K  a(x)  ^   e(x; 


(^)    Suppose  Z   |—  a(x)  =  B  (x)  .   To  show  S  |—  a  =  B ,  we 
use  induction  on  the  length  of  the  proof  from  Z  .   If 
a (x)  =  B(x)  is  an  axiom  (by  assumption,  it  is  not  "x  =  x"), 
then   S  f—  a  =  B. 

Suppose  a (x)  =  B(x)  is  obtained  from  a'(x)  =  B'(x)  by 
substituting  a  term  y(x)  for  a  variable  x,  and  S  | —  a'  -    B'. 

Then  Z^  |—  o'yCx)  =  B'y(x). 

Since  S  [—  a  '  =  B',   S  f—  a  '  y  =  6'Y- 
Suppose  a (x)  =  B(x)  is  obtained  from  a ' (x)  =  B'(x)  by  replac- 
ing  equals  by  equals.   Without  loss  of  generality,  we  can 
assume  that  the  equality  was  one  of  the  axioms.   Then,  it  is 
easily  seen  that  S  ) —  a  =  a'  and  S  | —  B'  =  B.   By  hypothesis, 
S  I —  a'  =  B',  and  putting  these  proofs  together,  we  have 
s  H  ct  =  B.  □ 

Remark  2.2.2    Note  that  in  Z   there  are  no  equations  of  the 


form  a (x) 


However,  if  S  contains  an  identity  e,  we  can 


consider  the  system  Z   with  the  additional  equation  e(x)  -   x 

and  the  above  theorem  still  holds. 
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This  theorem  sets  up  the  correspondence  between  deriva- 
tions in  semigroup  S  and  proofs  from  Z  .   Whenever  we  speak 
of  a  set  of  equations  Z   ,  we  will  assume  that  the  underlying 
equational  system  contains  functions  for  all  the  generators 
of  S  (for  it  is  possible  that  not  all  the  generators  appear 
in  the  set  of  defining  relations). 

For  the  rest  of  this  section  let  Z  be  a  set  of  equations 
between  unary  functions  containing  the  same    variable    on  both 
right  and  left-hand  sides.  We  wish  to  look  at  the  models  of  Z. 

First  of  all,  the  free  algebras  are  models  of  Z. 

Proposition  2.2.3    Z  always  has  a  nontrivial  model,  i.e.,  a 
model  consisting  of  more  than  one  element. 

Proof:   Consider  the  free  algebra  on  n  generators  (n  >^  2)  for 
Z  together  with  the  equations  g (x)  =  x  for  g   a  function  symbol 
in  Z.   This  algebra  contains  exactly  n  elements.  p. 

Proposition  2.2.4    Z  contains  models  of  every  cardinality. 

Proof :   If  all  the  functions  are  interpreted  to  be  the  identity, 
then  we  can  take  an  arbitrary  set  to  be  a  model  of  Z .     j-. 

Remark  2.2.5    In  Chapter  4 ,  we  will  present  systems  of  func- 
tional equations  where  the  intended  domain  of  interpretation 
is  W,   the  natural  numbers. 

Remark  2.2.6    Let  G  be  a  group;  suppose  that  a  presentation 
for  G  contains  the  relations  gg    =  e  for  g  a  generator  of  G. 
Let  Z   be  the  equational  system  associated  with  G  and  suppose 
that  Z   contains  the  equation  e(x)  =  x.   By  Cayley's  theorem, 

G 

G  itself  is  a  model  of  Z  :   we  can  think  of  G  as  a  group  of 

G 

permutations  (unary  functions),  and  the  relations  of  G  as 
relations  between  functions. 

From  Propositions  2.2.3  and  2.2.4,  we  see  that  a  set  of 
equations  Z  containing  g (x)  =  x  for  all  functions  g  has  non- 
trivial  models.   Such  interpretations  are  always  possible. 


but  they  are  rather  uninteresting  and  it  is  reasonable  to 
consider  these  models  as  in  some  sense  "trivial".   It  is 
interesting  to  note  here  that  a  group  is  defined  to  be  trivial 
if  w  =  e  for  all  elements  w;   regarding  the  elements  as 
permutations,  this  means  that  the  only  possible  permutation 
(consistent  with  the  relations)  is  the  identity  permutation. 
This  further  suggests  that  we  consider  those  models  of  Z 
"trivial"  in  which  all  functions  are  interpreted  to  be  the 
identity.   This  is  done  in  the  next  section. 


2.3   Consistency,  Completeness,  Decidability  -  Restricted 
Definitions 

Definition  2.3.1   (Tarski  [1968])    An  equational  theory  6 
is  called  aonsistent    if  it  does  not  contain  all  equations 
(with  given  operation  symbols)  or,  equivalently ,  if  it  does 
not  contain  the  equation  x  =  y. 

It  immediately  follows  that  6  is  consistent  if  and  only 
if  6  has  a  nontrivial  model. 

Definition  2.3.2   (Tarski  [1968])    An  equational  theory  9 
is  called  complete    if  there  is  no  consistent  theory  which 
properly  includes  6.   Or,  equivalently,  9  is  complete  if  for 
all  equations  a  =  3  with  given  operation  symbols,  either 
9  I —  a  =  3  or  9  u  {a  =  B)   is  inconsistent. 

Remark  2.3.3    The  following  theorems  from  predicate  logic 
extend  to  equational  theories : 

(i)   (Lindenbaum' s  theorem)    Every  consistent  theory  has  a 
consistent,  complete  extension. 

(ii)  Complete,  recursively  axiomatized  theories  are  decidable, 

We  will  now  apply  these  concepts  to  unary  equational 
theories  containing  only  equations  involving  one  variable 
(for  the  theories  arising  from  semigroup  presentations  are 
of  this  type) .   Let  9  be  such  a  theory. 


Proposition  2.3.4    G  is  consistent. 

Proof:   X  =  y,  being  an  equation  in  two  variables,  cannot  be 
a  logical  consequence   of  equations  of  one  variable.     r-. 

We  remark  that  in  Proposition  2.2.3  we  showed  that  6 
always  has  a  nontrivial  model,  giving  another  proof  that  6 
is  consistent. 

Proposition  2.3.5    9  is  complete  <>•  6  contains  all  equations 
involving  one  variable. 

Proof:   (=>■)   Suppose  not  all  equations  of  one  variable  are 
contained  in  0,  e.g.  a (x)  =  6(x)  ^  9.   Then  by  2.3.4, 
neither  is  9  u  {a  (x)  =  B(x)}  inconsistent,  i.e.,  9  is  not 
complete . 

(*=)   Suppose   9   contains  all  equations  of  one  variable. 
Let  a (x)  =  B(y)  be  an  arbitrary  equation.  Then  a (x)  =  x  and 
B(y)  =  y  are  in  9  by  assumption.   Therefore, 
6  u  {a(x)  =  B(y)}  |—  X  =  y,  i.e.,  9  u  {a  (x)  =  6(y)}   is 
inconsistent.   Therefore  0  is  complete.  p. 

Thus  we  cannot  make  distinctions  between  different  theories 
6  based  on  consistency  and  completeness.   This  happens  because 
the  set  of  equations   relative  to  which  consistency  is  defined 
is  in  a  sense  "too  large";  it  includes  equations  a(x)  =  6(y) 
(two  variables),  and  for  the  theories  9,  no  equations  involving 
two  variables  can  ever  be  proved.   This  suggests  restricting 
the  set  of  "all  equations"  to  "all  equations  of  one  variable", 
and  then  defining   consistency  and  completeness  relative  to 
this  set  of  equations.   To  emphasize  the  fact  that  only  equa- 
tions of  one  variable  are  involved,  we  call  these  notions 
1-consistency  and  1-completeness .   Let  9  be,  as  above,  a  unary 
equational  theory  containing  only  equations  of  one  variable. 

Definition  2.3.6    9  is  1-consistent      if  it  does  not  contain 
all  equations  of  one  variable  (with  given  operation  symbols) 
or,  equivalently ,  if  it  does  not  contain  the  equations   f(x)  =  x 
for  all  functions  f. 
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It  follows  from  the  completeness  theorem  for  equational 
logic  that  6  is  1-consistent  if  and  only  if  there  exists  a 
model  of  9  in  which  not  all  functions  are  the  identity. 

A  model  can  be  thought  of  as  being  trivial  if  it  is  a 
model  of  an  "inconsistent"  set  of  equations.   The  "trivial" 
models  in  this  sense  are  the  ones  in  which  all  functions  are 
the  identity.   Thus,  a  system  of  equations  may    have  models 
which  are  nontrivial  in  the  general  sense,  i.e.,  consisting 
of  more  than  one  element,  and  trivial  in  this  sense. 

Definition  2.3.7    6  is  1-aomplete      if  there  is  no  1-consistent 
theory  which  properly  includes  0,  or,  equivalently ,  if  for 
all  equations  a  =  B  of  one  variable,  either  a  =  3  e  6  or 
6  u  {a  =  3}   is  1-inconsistent . 

Definition  2.3.8    0  is  1-decidable    if  all  equations  of  one 
variable  are  decidable. 

Note  that  if  9  is  1-decidable,  then  0  is  in  fact  decidable, 
for  no  equation  a (x)  -    B(y)  involving  two  variables  is  prov- 
able . 

The  corresponding  analogues  of  theorems  given  in  2.3.3 
hold  with  the  restricted  notions  of  consistency  and  complete- 
ness : 

Proposition  2.3.9    A  1-complete  recursively  axiomatized  theory 
0  is  decidable. 

Proof:   Let  a  =  3  be  an  arbitrary   equation. 

(i)   If   a  =  3   is  an  equation  of  two  variables,  then  a  =  3  is 
not  provable. 

(ii)  If  a  =  3   is   an  equation  of  one  variable,  then  either 
0  I —  a  =  3  or  0  u  {a  =  3}   is  1-inconsistent,  and 
1-inconsistency  can  be  checked  using  2.3.6.         p. 

Proposition  2.3.10   (Lindenbaum ' s  theorem)    A  1-consistent 
theory  0  has  a  1-consistent,  1-complete  extension. 
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Proof:   Given  9,  let  A  =  {T  |  T  is  a  set  of  equations  containing 
only  one  variable  and  9  u  r   is  1-consistent } .   A  is  inductive, 
i.e.,  every  chain  has  an  upper  bound.   By  Zorn ' s  lemma,  A  has  a 
maximal  element,   A.  9  u  A  is  1-complete.  q 


2.4   Logic,  Groups,  and  Semigroups 

This  section  connects  ideas  from  logic  with  semigroups  and 
groups.  Throughout   this  section,  S  will  denote  a  semigroup 
with   identity,  G  a  group,  and  9   and  9    the  equational 
theories  associated  with  S  and  G,  respectively.  We  assume 
that  S  and  G  are  finitely  generated  and  presentations  are 
given,  and  that  9„  and  9„  contain  the  equation  e (x)  =  x, 

b  G 

where   e  is  the  identity.   The  semigroup  (or  group)  consisting 
of  only  the  identity  element  is  called  trivial. 

The  following  follows  from  definitions  and  2.2.1: 

Proposition  2.4.1   (i)   9   is  1-consistent  «■  S  is  nontrivial. 

(ii)  S  has  solvable  word  problem  <>  9   is  decidable. 

w  ,W2 
Let  w,  and  ^^   be  words  of  S.   S       is  the  semigroup  whose 

whose  relations  are  those  of  S   together  with  w   =  '^2  • 

Definition  2.4.2    S  is  simple    if  for  any  two  words   w, ,  w^ 

w, ,Wt  .     ^       . 

such  that  w,  7-^  w  ,   S  -■-   ^   is  trivial   (i.e.,  S  is  a  semigroup 

with  only  two  congruences:  the  identity,  modulo  which  it 

remains  unchanged,   and  the  universal  congruence,  modulo  which 

it  becomes  trivial.   This  is  also  called  aongvuenae    simplicity)  . 

Corollary  2.4.3    Let  g  ,g2,...,g^   be  the  generators  of  S. 

Then  S  is  simple  if  for  any  two  words   w  ,w  with  w   =  w_  the 

wt,W2  l      ^  1     ^ 

following  holds  in  S       :  g   =  g2  =  • . .  =  g^  =  e  . 

Proposition  2.4.4    S  is  simple  *>  9   is  1-complete. 
Proof :   S  is  simple  **  for  all  words  w  ,w   in  S,  either 


S  I —  w,  =  w   or   S  u  {w   =  w-}   is  trivial 
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e   is  l-complete  **  for  all  equations  a  =  6  involving 
one  variable  either   9   j —  a  =  B  or  9   u  {a  =  3}   is 
1-inconsistent . 

If  either  of  a  or  3  is  "x"  (suppose  a  is  "x") ,  then 
Q      \ —  X  =  6  ■»  9   I —  e(x)  =3.   Thus,  it  suffices  to 
consider  equations  a  =  3  in  which  neither  a  nor  3  is  "x" . 
The  result  follows  from  2.2.1.  p. 

Proposition  2.4.5    Finitely  generated  recursively  presented 
simple  semigroups  have  solvable  word  problem. 

Proof:   Let  S  be  a  finitely   generated  recursively  presented 
simple  semigroup  (with  identity).   By  2.4.4  9   is  l-complete. 
9   is  recursively  axiomatized,  and  so  by  2.3.9   ^c  is  decid- 
able.   By  2.2.1,  a  decision  procedure  for  8   gives  a  decision 
procedure  for  S .  g 

Let  G  be  a  finitely  generated  group.   A  maximal    normal 
subgroup    is  a  subgroup  N  of  G  such  that  G/N  is  simple.  A 
presentation  of  G/N  consists  of  the  relations  of  G  together 

with  w  =  e  for  w  6  N.  ' 

Proposition  2.4.6    Every  finitely  generated  group  has  a 
maximal  normal  subgroup. 

Proof:   In  view  of  the  above  remarks,  the  proposition  says 
that  any  finitely  generated  group  can  be  made  simple  by  add- 
ing relations.   But  by  2.2.1  this  is  Proposition  2.3.10 
for  9g.  a 

Simple  groups,  then,  do  act  like  complete  theories,  and 
enumerating  a  maximal  normal  subgroup  is  analogous  to  complet- 
ing a  theory.   We  note  that  for  first  order  theories,  if  we 
start  with  a  consistent  decidable  theory,  then  we  can 
effectively    obtain  a  consistent  complete  extension.  The  proof 
depends  on  the  fact  that  for  T  a  first  order  theory  and 
{A, , . . . ,A  }  a  finite  set  of  first  order  sentences. 
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T  u  {A,,..., A  ]  is  decidable  (for  any  first  order  sentence 
In  '' 

B,  T  U  {A,  ,  .  .  .  ,A  }  ]—  B  *»  T  k-  A,  &  ...  &  A   ^  B)  .   This 
in'         '1  n 

does  not  carry  over  to  equational  theories.   It  can  be  seen 
by  observing  that   for  a  group  G,  G  has  solvable  word  problem 
does  not    imply  that  G  u  A  has  solvable  word  problem,  for  A 
an  arbitrary  finite  set  of  relations  on  the  generators  of  G. 


2.5   Finite  Theories  and  Residually  Finite  Groups 

Residually  finite  groups  behave  similarly  to  finite 
equational  theories  in  the  following  sense:   if  G  is  a 
residually  finite  group,  w  a  word  on  the  generators  of  G, 
and  w  7^  1  in  G,  then  w  7^  1  in  some  finite  homomorphic  image 
of  G;  if  9  is  a  finite  equational  theory,  a  =  B  an  equation 
involving  the  function  symbols  of  6  ,  and  a  7^  B  in  6  ,  then 
there  is  a  finite  model  of  6  in  which  a  7^  B .   Now,  any 
homomorphic  image  of  G  satisfies  the  relations  of  G.  So, 
in  both  cases  an  equation  can  be  disproved  by  considering 
finite  models.   This  is  further  explored  in  this  section. 

Definition  2.5.1    For  an  equational  theory  0,  F0  is  the 
equational  theory  whose  valid  equations  are  just  those  in 
the  notation  of  G  which  hold  in  every  finite  model  of  6. 

An  equational  theory  6  such  that  6  =  F9  is  called  a 
finite    equational    theory.       Note  that  F6  is  consistent  if 
and  only  if  G  has  a  nontrivial  finite  model.  (These  defini- 
tions are  restrictions  to  equational  theories  of  the  ones 
for  first  order  theories  given  in  Trahtenbrot  [1950]). 

Clearly  logical  validity  implies  finite  validity.  To 
show  that  the  converse  does  not  hold,  we  first  need  the 
following  definition. 

Definition  2.5.2    Let  C  be  a  set  of  algebras.  K  is  generated 

by   C   if  every  member  of  K  is  obtained  by  taking  direct 

products,  homomorphic  images,  and   subalgebras  of  algebras 
in  C. 
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Note  that  by  Birkhoff s  theorem  [1935]  K  is  an  equational 
class.   Therefore  there  exists  a  set  of  equations  Z  such  that 
K  is  the  set  of  all  models  of  Z. 

To  show  that  finite  validity  does  not  imply  logical 
validity  consider  the  class  K  generated  by  an  infinite  cyclic 
group  and  a  finite  cyclic  group  of  order  k.   Clearly,  every 
finite  algebra  in  K  satisfies  x*x'...'x  =  1  (k  times),  but 
this  equation  is  not  logically  valid. 

Now  suppose  that  K  is  generated  by  a  set  C  of  finite 
algebras.   Then  Th(K),  the  set  of  equations  satisfied  by 
every  algebra  in  K  is  a  finite  theory.   (In  fact,  Th(K)  =  ThCC).] 
For  every  algebra  in  K  is  obtained  from  algebras  in  C  via 
certain  operations  and  if  a  =  6   is  true  in  every  finite  algebra 
in  K,  then  it  is  true  in  every  algebra  in  C,  and  therefore  true 
in  every  algebra  obtained  from  algebras  in  C  by  these  operations 
Furthermore,  if  C  contains  only  a  finite  number  of  algebras, 
then  Th(C)  is  effectively  enumerable,  for  we  can  effectively 
verify  if  an  equation  holds  for  all  algebras  of  C. 

Proposition  2.5.3    If  0  =  FO  and  6  is  finitely  axiomatizable, 
then  9  is  decidable. 

Proof:   If  6  =  FG,  then  for  any  equation  a  =  6,  either  Q     \ —  a  =3 
or  there  exists  a  finite  model  in  which  9  is  false.  Suppose 
9  is  finitely  axiomatized  by  the  set  of  equations  Z.  Then  we 
can  effectively  enumerate  everything  provable  from  Z.  Because 
Z  is  finite,  we  can  enumerate  all  finite  algebras,  check  if 
they  are  models  of  Z,  check  if  a  =  3  holds,  and  so   enumerate 
the  set  of  equations  which  fail  to  hold  in  some  finite  model. 
Since  9  =  F9 ,  every  equation  a  =  3  must  appear  on  one  of  these 
two  lists.  g 

We  can  get  equational  theories  as  in  the  proposition  by 
picking  for  the  set  C  above  a  finite  algebra  such  that  the 
set  of  equations  valid  in  C  is  finitely  axiomatizable.  A 
particular  example  is  a  finite  group  (Gates  and  Powell  [1964]). 
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Definition  2.5.4  A  group  G  is  residually  finite  if  for  each 
w  7^  1  in  G,  there  exists  a  normal  subgroup  N  such  that  G/N  is 
finite  and  w  7^  1  in  G/N. 

Proposition  2.5.5    Let  G  be  a  residually  finite  group.  Then 
6p  is  a  finite  equational  theory. 


Proof:   By  2.2.6,  G  is  a  model  of  0^.   Let  a(x)  =  3(x)  be  an 

G 

equation  on  the  function  symbols  of  6  .   By  2.2.1 

G 

e^    \—  a(x)    =    6(x)    «>G|—  a    =    6.       IfGf/a    =    6,    then   there 

exists  a  finite  homomorphic  image  of  G  in  which  a  7^  6 .  But 

this  homomorphic  image  is  also  a  model  of  6  .   Hence  there 

G 

exists  a  finite  model  of  9„  in  which  a(x)  ^    3(x)  and  so  G^ 

G  Ij 

is  a  finite  equational  theory.  p. 

Proposition  2.5.6    Finitely  presented  residually  finite 
groups  have  solvable  word  problem. 

Proof:   By  2.5.5   the  equational  theory  9^  associated  with 

G 

a  residually  finite  group  G  is  a  finite  theory.  By  2.5.3 
6^  is  decidable .  n 
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CHAPTER  3 
COMPUTATIONAL  COMPLEXITY  OF  WORD  PROBLEMS 

The  Grzegorczyk  hierarchy  of  functions,  defined  by 

A.  Grzegorczyk  [1953]  is  composed  of  an  infinite  sequence 

0     12 
of  classes   £   eg   eg   c  . . . ,  each  propertly  contained 

in  the  next  and  having  as  union  the  class  of  all  primitive 

recursive  functions.   Each  class  can  be  inductively  defined 

as  the  smallest  class  of  functions  containing  certain  initial 

functions  and  closed  under  the  operations  of  substitution  and 

bounded  primitive  recursion.   Using  the  limited  form  of 

recursion  has  the  effect  of  restricting  the  growth  of  the 

functions  within  each  class:   a  function  obtained  by  recursion 

from  functions  in  £   can  be  in  the  class  S   only  if  it  is 

bounded  by  a  function  already  in  the  class.   A  new  function, 

dominating  all  functions  in  £   is  obtained  by  unbounded 

recursion  from  functions  in  £  .   This  function  is  used  as 

an  initial  function   for  £ 

The  Grzegorczyk  hierarchy  is  particularly  interesting 
because  it  relates  the  computational  difficulty  of  a  function 
with  its  location  in  the  hierarchy.   A  function  can  be  shown 
to  lie  in  fi"^  (n  ^  3)  by  showing  that  the  amount  of  space  or 
the  amount  of  time  needed  to  co-npute  it  is  a  function  in  £  . 
This  criterion  is  machine  independent  in  the  sense  that  the 
amount  of  space  or  the  amount  of  time  can  be  measured  relative 
to  a  large  number  of  different  computing  machines  and  the 
result  is  the  same,  showing  that  the  computational  difficulty 
of  a   function  is  a  quality  intrinsic  to  the  function,  not 
depending  on  characteristics  of  specific  machines. 

Other  work  in  categorizing  functions  with  respect  to 
computational  difficulty  has  been  done  by  Cobham  [1964]  and 
Ritchie  [1963],   Ritchie's  hierarchy,  based  on  space  require- 
ments, is  a  refinement  of  the  class  of  elementary  functions. 
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We  say  that  a  group  G  has  word  problem  in  S   if  there 
exists  a  presentation  II   for  G  and  an  algorithm  solving  the 
word  problem  whose  computational  complexity  is  in  a'^,  i.e., 
the  amount  of  time  or  the  amount  of  space  used  by  the  algo- 
rithm is  an  £  -function  of  the  length  of  the  input.  We  show 
that  for  finitely  generated  groups,  having  word  problem  in  £^ 
is  a  property  of  the  group,  and  not  of  the  particular  presenta- 
tion of  the  group.   The  word  problem  for  free  groups  is  shown 

2 

to  be  m  S  ,  and  the  direct  product  of  G,  and  G„  is  shown  to 

have  word  problem  in  S   whenever  both  G,  and  G„  have  word 
problems  in  £  . 


3.1   The  Grzegorczyk  Hierarchy  -  Definitions  and  Main  Results 

Let  C  be  a  class  of  functions  from  W  to  W  (W  =  nonnegative 
integers) . 

Definition  3.1.1    C  is  closed   under    substitutions    if  C  is 
closed  under  the  following  three  operations: 

(i)    Composition  of  functions.   If  f  (x  ,  .  .  .  ,x,  _-,  ,  x,  ,x,  -,  ,  .  .  .x  ) 
and  g(yi,...,y  )  are  in  C,  then  so  is  their  composition 


f  (X-i/  •  •  •  '^]<_i''?'yi'  '  •  "  '  ^m  '^k  +  l'  *  *  *  '^n 


(ii)   Identification  of  variables.   If  f (x, , . . . ,x . , . . .x,  , . . .x  ) 

j_       J      K      n 

is  in  C,  then  so  is  the  function  obtained  by  replacing 
both  the  variables  x.  and  x,  by  y:   f (x,  ,  .  .  .y , . .  .y , . . .x  ) 

Here  y  is  a  variable  different  from  all  the  x.  's. 

^  1 

(iii)  Substitution  of  a  constant.  If  f (x, , . . . ,x,  , . . . , x  ) 
is  in  C,  then  so  is  the  function  f (x  , . . . , 0 , . . . , x  ) 
obtained  by  substituting   the  constant  0  for  x,  . 

Definition  3.1.2    Let  g,  h,  and  j  be  given  functions  and  let 
f  be  the  function  obtained  from  them  as  follows: 

f(u,0)  =  g(u)  (u  =  (u^,...,u^)) 

f(u,x+l)  -    h (u,x, f (u,x) ) 

f  (x,u)  <_  j  (x,u) 
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Then  f  is  said  to  be  defined  from  g,  h,  and  j  by  bounded 
primitive    recursion .       The  class  C  is  closed   under    bounded 
primitive    recursion    if  whenever  g,  h,  and  j  are  in  C, 
then  so  is  the  function  f   as  given  above. 

Definition  3.1.3    C   is  the  class  of  functions  of  n  argu- 

n 

ments  which  belong  to  the  class  C. 

Definition  3.1.4    The  function  F{x, ,...,x  ,t)  of  n+1 

1       n 

arguments  is  a  universal    function    for  the  class  C   provided 

that  for  each  function  f(xT,...,x  )  of  n  arguments  the 

1       n 

function  f  belongs  to   C   if  and  only  if  there  exists  a 

^    »    n  -' 

number  t  such  that  for  each  x, , . . . ,x  ,  f (x, , . . . ,x  ) 

1      n      1      n 

1      n 

To  define  the  Grzegorczyk  hierarchy,  we  first  present 
the  following  functions: 


fo(X'Y)   -  y  +  1 


For  n  >  2 , 


f^  (x,y)   =   X  +  y 
f2(x,y)   =   (x+1) (y+1) 

f„+l(x+l,y)  =   f^^^(x,f^^^(x,y)) 
Definition  3.1.5    For  n  e  W,  the  class  £   (Grzegorczyk ' s 

+-  H 

n    class)  is  the  smallest  class  of  functions  C  such  that 
(i)    the  functions  x+1,  U, (x,y)  =  x,   U  (x,y)  =  y,  and 

f  (x,y)  are  in  C. 
(ii)   C  is  closed  under  substitutions  and  bounded  primitive 

recursion . 

Another  definition  of  Grzegorczyk ' s  hierarchy,  using  a 
different  set  of  initial  functions  and  the  operation  of 
limited  exponentiation  is  given  in  Meyer  and  Ritchie  [1967], 
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We  note  now  some  results  about  the  Grzegorczyk  hierarchy 
which  will  be  used  later. 

Theorem  3.1.6  (Grzegorczyk  [1953]) 

...    (?n  _  pn+1 
(i)    £   5.  ^ 

(ii)    u   S   =  P   (the  primitive  recursive  functions) 
n^O 

(iii)  S   is  the  class  of  elementary  functions 

(iv)   For  n  >^  3 ,  the  class  £     contains  the  universal 
function  for  the  class  £, 

(v)    The  function  f   , (x,x)  increases  faster  than  any 
function  of  the  class  £, 

(vi)   Every  recursively  enumerable  set  is  enumerated  by  a 
function  of  class  £ 

(vii)  For  every  function  f  e  £    (n  >^  2)  there  exists  k 

such  that   f (x)  <  f  ^, (k,x)  for  all  x 

n+l 

(viii)  f  ^,  (x,y)  >  y  for  n  >  1 
n+l  — 

(ix)  f^^^{x+l,Y)     >    fj^+i(X/y)   for  n  ^  0 

(x)    f  (x,y+l)  >  f  (x,y)  for  n  >  0  . 
n  n  — 

Let  T  be  a  deterministic   Turing ' machine  with  a  single 
two-way  infinite  tape,  a  finite  alphabet,  and  a  finite  number 
of  states.   T  can  execute  the  follov/ing  instructions:  erase 
a  symbol  and  print  an^^thjr  one  in  its  place,  move  one  tape 
square  to  the  left,  or  move  one  tape  square  to  the  right. 
T  computes  a  numerical  function  f(n)  by  starting  out  with 
n  encoded  on  its  tape  as  a  sequence  of  n+l  I's  (assuming  1  is 
in  the  alphabet  of  T) .   The  computation  terminates  when  T 
reaches  a  final  state  and  the  value  of  f(n)  is  the  number 
of  I's  on  the  tape.   We  can  define  two  functions: 
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t^{n)  =   the  number  of  steps  (instruction  executions)  in 
the  computation  of  T,  starting  with  n  encoded 
on  its  tape 

s^(n)  =   the  number  of  tape  squares  scanned  during  the 
course  of  computation. 

Theorem  3.1.7  (Cobham  [1964])    For  each  k  ^  3,  the  following 
five  statements  are  equivalent: 

(i)     f  e   &^ 

(ii)    There  exists  a  Turing  machine  T  which  computes  f  and 
such  that  t^  G  &^ 

(iii)   There  exists  a  Turing  machine  T  which  computes  f  and 
a  function  g  e  £   such  that  for  all  n,  t  (n)  <_  g(n) 

(iv)-(v)  same  as  (ii)-(iii)  with  s^^,  in  place  of  t  . 

This  theorem  is  also  true  for  wider  classes  of  computing 
machines;  it  holds  for  Turing  machines  with  more  than  one  tape 
or  with  multidimensional  tapes.   It  also  holds  if  the  set  of 
possible  instructions  is  extended  to  include,  for  example, 
erasure  of  an  entire  tape.   This  suggests  that  the  computational 
difficulty  of  a  function  is  an  intrinsic  property  of  the  func- 
tion . 

Definition  3.1.8    The  oharacteristic    function   of  a  set  S  is 
a  function  Xq  such  that 


We  say  that  a  set  S  is  in  s"  if  and  only  if  its  character- 
istic function  Xc  is.   Note  that  if  S  is  in  S  ,  then  so  is  its 

complement  S,  for  x   =  1  ~  Xc- 

S 

It  is  not  clear   from  the  way  the  Grzegorczyk  hierarchy  is 

defined  whether  there  exist   0-1  valued  functions  in  £    -S  , 
i.e.,  whether  there  exist   sets  in  a""*"  -s'^.  Requiring  functions 
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defined  by  primitive  recursion  to  be  bounded  by  a  function 
already  in  the  class  has  the  effect  of  restricting  the  growth 
of  the  functions.   Thus  the  growth  of  a  function  seems  to  be 
a  major  factor  in  determining  where  it  lies  in  the  Grzegorczyk 
hierarchy.   However,  we  have  the  following  result. 

Theorem  3.1.9     For  n  >^  3 ,  there  exist  0-1  valued  functions 
in  £"^^-  £^. 

Proof:   For  n  >  3,  let  \b     , ,  be  the  universal  function  for 

n  ~  n+1 

the  class  £,  (cf.  3.1.6  (iv) ) .   Define 

{1   if   4^   ,  (x,x)   even 
n+l 
0   if   i|;^_^^(x,x)   odd 

Clearly,  g  ^^  G  £    ,   If  g  , ^  e  e"    then  there  exists   x. 
■^       ^n+1  ^n+1  0 

such  that  g  _,  T  (x)  =  \p     , ,  (x.  ,x)  .  Then 
n+l       n+l   0 

(    1      if  i)     ,  ^   (x„,x-)  even 
/       \  ,  /  \  I  ^n+1   0   0 

^n+l^^O^  =  '^n+1^^0'^0^    1  n   -^   ,    ,  .  oHH 

I  °   ^^   "^n+l^^O'^O^  °^^ 

which  cannot  happen.   Hence  g  ,,  cannot  be  in  S  .        -i 

'^'^  ^n  +  1  U 

S  4.1  =  {J<  I  g  ^1  (x)  =0}   is  a  set  in   s""^"""  -  &^    . 
n+l       '   n+l 

It  is  not  known  whether  this  result  also  holds  for  n  <  3. 
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3.2   Iteration  and  the  Grzegorczyk  Hierarchy 

Definition  3.2.1    Let  h,  and  h_  be  given  functions  and  let 
g  be  the  function  obtained  from  them  as  follows: 

(i)  g(0,y)  =  h^(y) 

(ii)  g(x+l,y)  =  h2g(x,y) 

Then  g  is  said  to  be  defined  from  h,  and  h„  by  iteration. 
If  in  addition  to  (i)  and  (ii)  ,  g(x,y)  <_   h^(x,y)  where  h^ 
is  a  given  function,  then  g  is  said  to  be  defined  from  h, ,h„ 
and  h^  by  bounded   iteration . 

Let  j (x,y)  be  a  one-to-one  mapping  of  the  set  of  all 
pairs  of  natural  numbers  onto  the  set  of  natural  numbers. 
Let  k  and  I   be  the  inverse  functions:   kj(x,y)  =  x  and 
£j(x,y)  =  y.   Any  such  function  j  is  called  a  pairing   function, 
Examples  of  pairing  functions  are: 

(i)   (Cantor  pairing  function)   j(x,y)  =  ( (x+y+1)  (x+y))/2  +  x 

(ii)  (Exponential  pairing  function)   j(x,y)  =  (2x  +  1)2-^  -  1 
(J.  Robinson  [1967] ) . 

Note  that  any  pairing  function  j  has  to  be  at  least  quadratic, 

2 
I.e.,  £   is  the  lowest  level  of  the  Grzegorczyk  hierarchy  m 

which  a  pairing  function  can  occur. 

Theorem  3.2.2    If  f ,h  e  g"   (n  ^  2) ,  then  the  function  g 
defined  by 

g(0,y)  =  h(y) 

g(x+l,y)  =  fg(x,y) 

pn+1 
IS  in  £ 

Proof:   By  3.1.6  (vii),  f  e  £^  =>  there  exists  k   such  that 

f(y)  <  Vl^^l'^) 
Also  by  3.1.6  (vii),  h  e  g"  =>  there  exists  k-  such  that 

h(y)  <  f^^i(k2,y) 
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Let  k  =  max  {k,,k„}.   Then 

f(y)  <  f^^i(k,y) 
and 

h(y)  <  ^n+i^^'Y)    . 

We  will  show  by  induction  on  x  that 

g(x,y)  <  f^_^^(k+x,y)  (*) 

For  X  =  0,  g(0,y)  =  h(y)  <  f^_|_^(k,y). 
Suppose  the  inequality  (*)  holds  for  x.  Then 

g{x+l,y)  =  fg(x,y)  <  f^_^^  (k  ,g  (x,y )  )  <  f^^^  (k  ,  f^^^  (k+x,y )  ) 

^  ^n+l^^'^^'^n+1^^'^^'^^  ^  "  fn+l^*^"^^"^-^'^^  • 

D 

Remark  3.2.3    By  using  R.  Robinson's  technique  for  simplifying 
recursion  [194  7],  we  can  show  that  if  j ,k  and  I   are  added  to 
the  initial  functions  of  £   (i^  ^  level  of  j )  ,  then  bounded 
recursion  can  be  replaced  by  bounded  iteration  of  the  form 

g(0,y)   =  h^(y) 

g(x+l,y)=   h2g(x,y) 

g(x,y)   _<   i(x,y) 

in  the  definition  of  the  Grzegorczyk  hierarchy. 

We  will  now  show  that  for  n  ^  2,  the  functions  f  ^-i  can 
be  obtained  by  iteration  from  functions  in  £  .   For  n  >_  2 , 
we  define 

d(0,y)  =  y 

Cl^^^l'^)  =   fn^Cl^^'^^-^^'Cl^^'^)-^^) 

Lemma  3.2.4    For  n  ^  2,  f* ^^{x+1,y)    =    f^  +  i d ' ^ ^+1 ^^ '^^ ^  ' 
Proof;   f*^^(x+l,y)  =  f^  (f  *_^^  (x,y ) +1 ,  f^_^^  (x,y ) +1 )  and 

Cl^^'Cl^^'^))  ^  ^n^d^O'Cl^^^y))  -^1'  Cl^O'Cl^^'^)^-^^^ 
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Lemma  3.2.5    For  n  >  2,  f  ^+1  ^^  '  ^n+1  ^^ '^^  ^  "  ^n+1  ^^' ^n+1  ^"^ '^^  ^ 
Proof :   By  induction  on  x.   For   x  =  0, 

Suppose  the  lemma  holds  for  any  x.   Then 

Ci(i'd(-^i'^))''=''d(i'Ci(i'Ci(-'^))) 


hyp   *       *       *  3.2.4 

=   f„,T(l,f„^T{x,f„,,(l,y)))   =   f"  , (x+l,f„^, (l,y)). 


* 


D 


Lemma  3.2.6    For  x  >^  2,  f  ^+1  ^^l''"^2 '^^  ^  ^n+1  ^^1 '  ^n+1  ^^2 '^^  ^ 

Proof :   By  induction  on  x„ .   If  x„  =  0,  there  is  nothing 
to  prove.   If  x„  =  1,  this  is  Lemma  3.2.4.   Suppose  the 
lemma  holds  for  x_  >  1,  for  fixed  x, .   Then, 

*  hyp&3.2.4^       ^        ^ 

^n+l(^1^^2^1'^)       =      ^n+l(l'fn+l(^l'fn+l(^2'y^)) 

=   fn+l^^l'^n+l(l'^n+l(^2'y)^)   =   ^ n+1 ^^1 ' ^n+1 ^^2^^ '^ ^ ^ 

n 

Theorem  3.2.7     For  n  >  2,  f  ^.{x,y)    =    f  ^, (2^,y). 
—     n+1    -^      n+1     -^ 

Proof:   By  induction  on  x.   For  x  =  0: 

f*^i(2°,y)  =  f*^,  (l,y)  =  f  (f*^,  (0,y)+l,f*_^,  (0,y)+l) 
n+1    '^  n+1    -^      n   n+1    -^      n+1  '^ 

=   f^(y+i,y+i)  =  fji+l^^'^^ 
Suppose  the  theorem  holds  for  x.  Then 
Ci(2^-'\y)  =  f;^l(2.2^y)  =  f;^^(2^+2^y) 

3-2.6   ^        ^  hyp 

=    fn+l(2^fn+l(2^y))   =   f  ^+1  (-' ^n+1  ^^'^^  > 
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*  n+1     n 

Corollary  3.2.8    For  n  ^  2,  f^+i^^'V)  ^  ^     -  £  . 

Proof:   For  n  =  2,  it  can  be  shown  by  induction  on  x  that 


for  all  y. 


fj^+l(x,y)  >  y^  e  £"  -  £• 


For  n  >  2,  if  f   ,  were  in  S  ,  then  so  would  f„+i (2  ,y) 

=  f   ,  (x,v)  ,   since  s"^  is  closed  under  composition.  But 
n+1 ^    ' ^  ' ' 

this  implies  that  f  ^n  ^  S   --  contradiction.  Thus, 


Now,  the  function  g (x)  =2   can  easily  be  defined  from 
the  functions  s (x)  =  x+1  and  f^(x,y)  by  bounded  iteration: 
Let  g, (x)  be  defined  by: 

g^(0)   =   0 

q^{K+l)    =    ssg^(x) 

g^(x)  <  f2(l,x)      (i.e.,  g^ (x)  -  2x) 

Let  g(x)  be  defined  by: 

g(0)   =   1 

g(x+l)  =  g-]^g(x) 


* 


,x 


g(x)  <_   f2(x,2).   Then,  g(x)  =  2  . 

*  n 

Hence,  for  n  >  3,  f   can  replace  f   in  the  definition  of  £  , 
—      n  n 

and  together  with  3.2.3  we  have: 

Proposition  3.2.9    For  n  ^  3,  s"  can  be  defined  as  the 
smallest  class  of  functions  containing  s (x)  =  x+1,  U^(x,y)  =  x, 
U-(x,y)  =  y,  f  (x,y),  j,  k,  £  and  closed  under  the  operations 
of  substitution  and  bounded  iteration. 


Proposition  3.2.10    The  functions  f^(x,y)  =  x+y  and 
f2(x,.y)  =  (x+1)  (y+1)   can  be  d( 
by  substitution  and  iteration. 


f„(x,.y)  =  (x+1)  (y+1)   can  be  defined  from  s  (x)  =  x+1,  j,  k,  & 
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Proof :   f, (x,y)  can  be  defined  by 

fj^{0,Y)    =  y 

f^(x+l,y)  =  l+f^(x,y) 

To  define  f  (x,y),  first  define  f': 

f2(0,y)  =  j  (0,y+l) 

f2(x+l,y)  =  j  (sk,f^(k,£) )f •  (x,y) 
Then, 

f2(x,y)  =  £f2(x,y)  .  g 

From  3.2.9  and  3.2.10  we  see  that  every  function  in  £ 
(n  >^  3)  can  be  defined  in  a  special  way: 

Proposition  3.2.11    Any  function  in  £   (n  >_  3)  can  be  defined 

from  s(x)  =  x+1,   U  (x,y)  =  x,   U  (x,y)  =  y,  j,  k,  and  I 

by  repeatedly  constructing  a  new  function  from  previously 

obtained  functions   by  substitutions  and  iteration. 

Furthermore,  unbounded  iteration  is  needed  only  to  define 

* 

the  initial  function  f  . 

n 


3.3   a'^-Decidable  Groups 

Definition  3.3.1    A  group  G  has    word   problem    in    S   if  there 
exists  a  presentation  11   for  G,  a  Turing  machine  T  and  a 
function  g  g  a"  such  that  for  any  word  w  on  the  generators 
of  G,  T  decides  in  <_   g(length(w))  steps  whether  or  not  w  =  1 
in  G.   A  similar  definition  holds  for  semigroups. 

Note  that  for  n  ^  3,  this  is  equivalent  to  using  "space" 
in  place  of  "time"  (3.1.7). 

The  word  problem   for  groups  is  treated  similarly  to 
the   recognizability  problem  for  formal  languages:  the  set 
of  words  of  G  which  equal  1  is  the  set  of  finite  strings  of 
symbols  accepted  by  a  Turing  machine  (Hopcroft  and  Ullman 

[1969] )  . 
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other  analogies  between  word  problems  for  groups  and 
recognizability  of  languages  have  been  exhibited.  G. 
Sacerdote  [1977]  defines  a  finitely  generated  group  to  be 
automatic    if  the  set  of  words  of  the  group  which  equal   1 
is  the  set  of  finite  strings  of  symbols  accepted  by  a 
finite  automaton.   It  has  been  shown  that  automatic  groups 
are  precisely  the  finite  groups.   An  algebraic  characteriza- 
tion for  context  free  groups  is  asked  for. 

We  say  that  G  is  S  -deaidable    if  G  has  word  problem 

on 
m  S  . 

Given  a  group  G  (and  a  presentation  11^)   and  an  element 
w  s  G,  we  can  try  to  see  if  w  =  1  in  G  by  using  the  relations 
of  G  to  transform  w  into  1.   If  w  is  in  fact  equal  to  the 
identity,  then  a  proof  will  be  found.   If  G  is  a  finitely 
presented  group   then  we  can  easily  construct  a  (nondetermin- 
istic)  Turing  machine  to  simulate  this  procedure.   Roughly, 
the  alphabet  of  the  Turing  machine  would  contain  the  generators 
of  G  and  it  would  work  by  locating  the  right  or  left  side  of 
a  relation  and  replacing  it.   To  estimate  the  number  of 
instructions  executed  in  applying  a  relation  to  w,  we  just 
note  that  to  do  so ,  a  portion  of  w  might  have  to  be  moved, 
depending  on  the  length  of  the  word  being  substituted.  The 
number  of  instructions  executed  in  applying  one  relation  is 
about  k« length (w)  and  the  length  of  the  resulting  word  is 
<_  k  •  length  (w),  where  k  is  a  constant  equal  to  the  number  of 
instructions  needed  to  move  one  letter  of  w.   By  induction  the 
number  of  instructions  executed  in  applying  n  relations  is 
<_  n-k  •  length  (w)  . 

If  we  have  a  bound  on  the  number  of  relations  that  need 
to  be  used  to  see  if  w  =  1,  then  the  procedure  is  effective. 
Suppose  that  we  need  <_  f  (length  (w))   relations  to  decide  if 
w  =  1.   Then  we  can  construct  a  Turing  machine  which  works 
by  first  computing  f (length (w))  and  then  like  the  one  described 
above,  with  the  extra  step  that  it  subtracts  1  from  f (length (w)) 
when  a  relation  is  used.   It  terminates  when  f (length (w)) 
becomes  0  or  w  gets  transformed  into  1. 
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We  can  also  construct  a  determinnstic  Turing  machine, 
not  much  more  complex  with  respect  to  time  and  space  require- 
ments than  the  nondeterministic  one  above,  to  solve  the  word 
problem  for  G.  W.  Savitch  [1970]  has  shown  that  a  nondeter- 
ministic Turing  machine  using  s (n)  tape  squares  on  an  input 

of  length  n   (s(n)  >  log  (n) )   can  be  simulated  by  a 

2 
deterministic  Turing  machine  using  at  most  (s(n))   tape 

squares.   It  can  easily  be  seen  that 

Remark  3.3.2    For  n  >^  3 ,  if  the  number  of  relations  used 
in  deciding  w  =  1  is  £  h (length (w) ) ,  h  g  S  ,  then  G  has 
word  problem  in  £^. 

Here  we  are  taking  into  consideration  that   the  length 
of  the  word  may  increase  in  applying  a  relation.   If  this 
does  not  happen,  then  this  result  holds  for  n  >_  2 . 

In  what  follows  we  try  to  locate  where  the  word  problem 
of  a  group  lies  with  respect  to  the  Grzegorczyk  hierarchy. 
The  number  of  relations  used  to  decide  if  w  =  1  (for  a 
finitely  presented  group)  is  a  convenient  way  of    measuring 
the  difficulty  of  the  word  problem.   We  remark  that  in 
counting  the  number  of  relations  used,  we  consider  the 
following  to  be  relations:   l*g.  =  g-,   q.-l  =  q.  , 
g.g.   =1,   g.  g.  =1,  where  g.  is  a  generator  of  G. 

Theorem  3.3.3   A  finitely  generated  free  group  has  word 

2 
problem  in  S  . 

Proof :   Let  G  be  a  free  group  on  n  generators.   It  is  easily 
seen  that  the  word  problem  for  G  is  equivalent  to  the  member- 
ship problem  for  the  context  free  language  consisting  of 
all  well-formed  expressions  over  n  pairs  of    matching  left 
and  right  parentheses,  where  parentheses  may  balance  on 
either  side.   It  has  been  shown  [Hopcroft  and  Ullman,  1969] 

that  the  number  of  steps  needed  to  recognize  a  context  free 

2 
language  is  in  £  .  p. 
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This  result  can  also  be  proved  by  noting  that  the  number 
of  relations  needed  to  show  w  =  1  in  a  free  group  is  £  length (w) 
and  using  3.2.2.   (The  length  of  the  word  never  increases  in 
the  course  of  a  derivation,) 

Let  G^  c  G2  ,  Gj 
Since  G,  is  embedded  in  G„  ,  for  each  generator   g.  of  G^ 

there  exists  a  word  w   on  the  generators  of  G„  (under  the 

^i 
embedding  isomorphism) .   For  v  a  word  m  G,  ,  let  w   be  the 

word  of  G„  obtained  by    replacing  g.  with  w_   and     g.   with 

w   .   Thus,  we  have  that  v  =  1  in  G,  ■*  w   =  1  in  G_.  Now 
g^  1    V         2 

if  the  length  of  the  longest  w   is  k,  then  the  length  of  w   is 

^  ^i  "^       V 

at  most  k • length (v).  We  can  conclude  the  following: 


(i)    If  Gy   has  word  problem  in  £   (n  >_  1)  ,  then  G   has  word 
problem  in  £   (if  the  number  of  steps  required  is 
g(length(u))  for  u  a  word  of  G„  ,  then  the  number  of 
steps  for  a  word  v  in  G,  is  g (k -length (v) )  ). 

(ii)   If  a  finitely   generated  group  G  has  word  problem  in  £ 
(n  >^  1)  for  some  finite  system  of  generators,  then  G 
has  word  problem  in  £   for  any  other  finite  system 
of  generators. 

(iii)  The  word  problem  for  a  finitely  generated  subgroup  of 

G  cannot  be   more  difficult  than  the  word  problem  for  G. 

Given  two  groups  A,  B  with  word  problems  in  £  ,  it  is 
frequently  interesting  to  see  how  difficult  is  the  word 
problem  for  the  group  G  obtained  from  A  and  B  by  using  some 
standard  construction  from  group  theory.   We  show  that  the 
word  problem  for  the  direct  product  of  A  and  B  is  not  more 
difficult  (i.e.,  higher  in  the  Grzegorczyk  hierarchy)  than 
the  more  difficult  of  A   and  B.   We  will  show  in  Chapter  5 
that  the  Britton  extension  of  a  group  A  can  have  word  problem 
substantially  harder  than  A. 

Definition  3.3.4    Let  K  =  (k^,...,k  |R)  and  Q  =  (q,  ,  . . . ,q  I S)  . 
Let  e  be  a  homomorphism   6  :  Q  ->  Aut(k)  defined  by  6  (k)  =  xkx 
for  all.k  e  K,  x  G  Q.   The  semidirect   product      K  x„  Q  is 
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the  set  of  all  ordered  pairs  (k,x)  s  k  x  Q  under  the  binary 
operation  (k^,x)(k2,y)  =  (k^G^ (k^ ) ,xy) . 

A  presentation  for  K  '.  Q  is  the  following: 


generators:   kw...,k  ,  q^,...,q 
1      n   ^1     ^m 

relations:    R,  S 

(*)        q^kj  =  w^^q^   ,   q^k^   -  ^ij'^i  '    i  =  l,...,m 

/**^        -lu         -1     -1,  -1    -1-1 

(**)       q.  k..=w..q.  ,   q.  k.  =w..q.  ,    1=1, ...,n 

(w. .  and  w. .  are  words  on  k, , . . . ,k  ) . 
13      -13  1'    '  n 

(Actually,  since  6    is  an  automorphism,  the  w. .  are  a  set 

of  generators  for  K  and  relations  (**)  can  be  obtained  from  (*).) 

We  note  the  following  facts: 

(i)    Every  element  of  K  x„  Q  can  be  expressed  as  kq , 
where  k  s  k,  q  e  Q. 

(ii)   Let  g  =  kq,  where  k  e  K,  q  e  Q.   Then  g  =  1  in  K  " a    Q 
if  and  only  if  k  =  1  and  q  =  1. 

Proposition  3.3.5    If  K  and  Q  have  word  problem  in  £  ,  n  >_  3 , 
then  K  X   Q  has  word  problem  in  £  . 

Proof:   To  decide  if  w  =  1,  we  put  w  into  normal  form  (i) 

and  then  use  the  algorithms  for  solving  the  word  problem 

for  K  and  Q  (by  (ii) )  . 

Let  r  be  the  maximum  length  of  any  w.  .  or  w. . .   To  put 

13     -ID 
w  into  normal  form,  scan  from  left  to  right,  moving  the  q's 

to  the  right.   Let  \w\    -    length  of  w.   The  length  of  the  word 

can  increase  by  r  each  time  a  relation  (*)  or  (**)  is  used; 

thus,  to  move  one  q.  over  to  the  right  can  require  at  most 

|w|  relations  and  the  length  of  the  resulting  word  is  ;<  r|w|. 

The  next  q.  has  to  be  moved  across  a  word   of  length  at  most 

r|w| ,  resulting  in  a  word  of  length  at  most  r  |w| .    By 

induction,  to  move  all  the  q's  to  the  right  requires  at  most 

|w|  r'  '     relations  and  the  length  of  the  resulting  word 

I  w  I  1   1 
IS   r '    w  . 
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Let  K  have  word  problem  solvable  in  _<  g(|w|)  steps,  g  g  fi  , 
Let  Q  have  word  problem  solvable  in  <  h(|w|)  steps,  h  e  £  , 


Th 


en  K  Xq  Q  has  word  problem  solvable  in 


£f(|w|)  -  |w| ^r '^'+  g( |w|r '"' )+h( |w|r '"' )  steps, 
and  f  is  in  £    for  n  ^  3  .  ^ 


Remark  3.3.6    If  r  =  1  in  the  above  proof,  then  the  proposi- 
tion holds  for  n  >  2. 
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CHAPTER  4 


S'^-DECIDABLE  SEMIGROUPS 


If  Z  is  a  system  of  functional  equations  with  function 
symbols  f^,...,f  ,  then  we  can  associate  with  Z  a  semigroup 
S^  ,  consisting  of  words  over  f,  ,...,f   with  the  operation 
of  juxtaposition,  and  whose  set  of  relations  is  E.   This 
provides  a  very  elegant  method  for  constructing  semigroups, 
for  we  have  a  concrete  interpretation  of  the  generators  as 
functions.   J.  Robinson  [1968]  showed  that  the  word  problem 
for  semigroups  is  recursively  unsolvable  by  using  a  system 
of  functional  equations  defining  a  nonrecursive  function. 
We  use  this  idea  to  construct  decidable  semigroups. 

However,  defining  recursive  functions  does  not  guarantee 
decidability  for  the  system  Z.   We  give  simple  examples  of 
equations  which  we  would  expect  to  be  derivable  from  Z ,  but 
which  are  not.   The  reason  for  this  is  that  we  think  of  the 
functions  in  Z  as  being  defined  over  the  natural  numbers,  but 
in  defining  derivability  from  Z,  we  are  considering  Z  to  be  a 
formal  system  without  reference  to  any  domain  of  interpreta- 
tion.  Thus,  systems  of  functional  equations  are  equational 
systems  (cf.  Chapter  2)  and  there  exist  domains  for  the 
functions  other  than  the  natural  numbers.   Some  examples  are 
given . 

A  system  of  functional  equations  Z,  however,  can  be  made 
decidable   by  adding  relations  to  obtain  a  new  system  Z   in 
such  a  way  that  the  functions  computed  by  Z   are  the  same  as 
those  computed  by  Z .   We  show  that  for  n  >^  4 ,  there  exist 
systems  of  functional  equations  Z  computing  functions  in  S 
and  such  that  Z  is  S  -decidable. 


n 
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4.1   Functional  Equations  -  Definitions  and  Properties 

In  this  section  we  define  and  give  the  main  properties 
of  systems  of  functional  equations,  as  presented  in  J.  Robinson 
[1968]  . 

Small  letters   m,n,t,x,y  will  refer  to  elements  of  N, 
all  others  to  functions  from  W  to  W . 

Definition  4.1.1    The  function  f  is  obtained  by  general 
recursion    from  functions  a,b,g,h  if 

(i)    f a  =  g ,  fb  =  hf,   and 

(ii)   every  natural  number  n  belongs  to   Range (b  a)  for  some 
m  >_  0  . 

From  (i)   we  have   fba(t)  =hg(t)  for  any  teW.   Ifa,b,g, 
and  h   are  computable  and  (ii)  holds,  then  given  any  n  we  can 
effectively  find  m  and  t,  so  that  n  =  b  a(t)  and  then  f  is 
effectively  computable.   In  particular,  if  a  =  o  (the  zero 
function   o(x)  =  0)  and  b  =  s  (the  successor  function 
s (x)  =  x+1) ,  then  (i)  and  (ii)  are  satisfied. 

Throughout  this  chapter,  s  will  denote  the  successor 
function  and  o  will  denote  the  zero  function. 

J.  Robinson  [1968]  showed  that 

Theorem  4.1.2    Every  recursive  function  of  one  variable  can 
be  obtained  from  o  and  s  by  repeated  compositions  and  general 
recursions  from  previously  defined  functions. 

Thus,  every  primitive  recursive  function  and  therefore 
every  function  in  £   can  be  so  defined.   It  is  not  immediately 
clear  that  if  f  is  in  S   (n  >^  0)  and  obtained  as  above, 
whether  all  auxiliary  functions  obtained  in  the  process  are 
also  m  £  . 

We  will  show  that  for  n  >_  3 ,  any  function  in  £   can  be 
obtained  as  in  the  theorem  in  such  a  way  that  all  functions 
obtained  on  the  way  are  in  S  .   The  proof  is  essentially  the 
same  as  that  given  in  J.  Robinson  [1968]  ,  with  verifications 
that  the  functions  concerned  are  in  £  . 
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Let  j  be  a  pairing  function.   Let  n.  be  the  Grzegorczyk 
level  of  j,  i.e.,  j  G  6   . 

Definition  4.1.3    f  is  defined  from  a,  b,  h,  g,  and  p  by 
bounded   general    recursion    if  f  is  obtained  by  general 
recursion  from  a,  b,  g,  h  and  f  (x)  <_   p(x)  for  all  x. 

Let  C  be  a  class  of  functions.    C  is  closed   under   bounded 
general    recursion    if  whenever  g,  h,  and  p  are  in  C   then  so 
is  the  function  f  as  given  above.   (Bounded  general  recursion 
is  actually  bounded  iteration  (3.2.1),  where  all  the  functions 
involved  are  of  one  variable.) 

We  have   from  3.2.11: 

Proposition  4.1.4    Any  function  in   S   (n  >^  3)  can  be  obtained 
from   s(x)  =  x+1,   U^(x,y)  =  x,   U2(x,y)  =  y,   f*(x,y),  j,  k,  £ 
by  repeatedly  constructing  a  new  function  from  previously 
obtained  functions  by  substitutions  and  bounded  iterations  of 
the  form: 

g (x, 0)  =  ax 

g(x,sy)  =  bg(x,y) 

g(x,y)  =  h(x,y) 

Let  i  be  the  identity  function  i (x)  =  x.   Suppose  that  the 

functions  j(i,o)  and  j(k,sil)  defined  by  j(i,o)(x)  =  j(i(x),o(x)) 

n-j 
and  j(k,s£)(x)  =  j  (k  (x)  ,  s£  (x)  )  are  in  &    ■' . 

The  next  result  shows  that  bounded  iteration  can  be 

replaced  by  bounded  general  recursion  of  a  special  type. 

Proposition  4.1.5    In  the  definition  of   £   (n  ^  n.),  bounded 
iteration  can  be  replaced  by  bounded  general  recursion  of  the 
form: 

f  j  (i,  o)  =  a 

f  j  (k,s£,)  =  bf 

f  (x)  <_  c  (x) 

Proof:   Let  g  be  defined  by  bounded  iteration: 

g(x,0)  =  ax 

g(x,sy)  =  bg(x,y) 

g(x,y)  <_  h(x,y)  ,      a,b,h  e  £  . 
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Then  g  can  be  defined  by  bounded  general   recursion  as  follows 
Define  f: 

f j (i,o)  =  a 

f  j  {k,sZ)    =   bf 

f (x)  <  h(kx,£x) 
Then  g (x,y)  =  f j {x,y) . 

Similarly,  given  a  function  f  defined  by  bounded  general 
recursion,  we  can  define  f  using  bounded  iteration: 
Suppose  f  is  defined  by: 

f j (i,o)  =  a 

f j (k,s£)  =  bf 

f(x)  <_  c(x)   ,     a,b,c  e  s"  . 
Let 

g(x, 0)  =  a (x) 

g(x,sy)  =  bg(x,y) 

g  (x,y)  <_  cj  (x,y) 
Then,  f{x)  =  g(kx,Jlx)  .  P 

If 

The  functions  f  (x,y)  were  defined  by  iteration.   But  we 

I 
could  have  defined  functions  f  (x)  by  general  recursion  as  m 

the  proof  of  4.1.5,  so  that  f*(x,y)  =  f'j(x,y).   Thus,  the 

I  n  n  ^ 

functions   ^n^^^  ^^"  replace  f*(x,y)  in  the  definition  of  £ 
(n  >_  n  .  )  .  . 

We  now  wish  to  consider  functions  of  one  variable  in  £  . 
This  class  will  be  denoted  S, . 

Proposition  4.1.6  (J.  Robinson  [1968])    Every  function   in 
£,  (n  >_  n.)  can  be  obtained  from  o,  s,  k,  and  i   by  repeatedly 
constructing  a  new  function  f  from  previously  obtained 
functions  a  and  b  by  composition,  pairing  (f  =  j (a,b) ) ,  or 
general  recursion  of  the  form  fj(i,o)  =  a;  fj(k,s£)  =  bf. 

Proof :    Pirst,  the  initial  function  f'  (>^)  can  be  so 
constructed.   Hence  (by  4.1.5)  every  function  in  ^i   can  be 
obtained  from  j ,  k ,  £   by  general  recursion  of  the  form  of 
the  proposition,  and  substitution.   To  obtain  functions  of 
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one  variable  by  substitution,  it  is  always  sufficient  to  use 
substitution  to  define  only  functions  of  one  variable.  The  only 
initial  function  of  more  thanone  variable  which  yields  new 
functions  of  one  variable  is  j.   Hence,  besides  compositions 
of  functions  of  one  variable,  we  must  allow   f  =  j(a,b)   to 
be  constructed  from  previously  obtained  functions   a  and  b. 
Hence  all  functions  in   £   (n  >_  n.)  of  one  variable  are 
obtained  from  o,  i,  s,  k,  and  I   by  compositions,  pairings,  and 
general   recursion  of  the  form  given  in  the  proposition.  Since 
i  -    j(k,2,),  we  can  omit  i  from  the  initial  functions.     p. 

Proposition  4.1.7   (J.  Robinson  [1968]).    Every  function  in 
£,  (n  >_  n.)  can  be  obtained  from  o  and  s  by  repeated  composi- 
tions and  general  recursions  from  previously  defined  functions 

pn 
in  £,  . 

Proof :   Let  R  be  the  class  of  functions  obtainable  from  o  and  s 
by  composition  and  general  recursion.   Then,  i,  k,  and  I   are 

in  R: 

io  =  o  is  =  si 

kj(i,o)  =  io  kj(k,s2.)  =  ik 

£j(i,o)   =o  £j(k,s£)=s2. 

R  is  closed  under  pairing  (J.  Robinson),   Furthermore,  to 
define  functions  in  £,  ,  only  function?  in  £,  are  needed. 

Hence,  every  function  in  £ -,   (n  >_  n.)   belongs  to  R, 
provided  there  is  a  pairing  function  j  such  that  j(i,o)  and 
j(k,sJ?,)  are  in  R.   One  such  j  is  given  by  j(x,y)  =  (2x  +  1)2"-1. 
For  this  j,  j(x,o)  =  2x,  'j(x,sy)  =  2j(x,y)  +  1.   Hence 
j(i,o)  is  the  double  function  d  and  j(k,s2,)  is  sd.   d  belongs 
to  R  since  it  is  defined  by  the  general  recursion  do  =  o, 
ds  =  ssd.   Hence  every   function  in  £,  (n  >^  3)  belongs  to  R.  □ 

It  is  possible  that  the  theorem  holds  for  n  >^  2 ,  provided 
that  there  is  a  quadratic  pairing  function  such  that  j(i,o) 
and  j(k,s2,)  are  easily  definable. 
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A  functional    equation    is  an  equation  of  the  form 

1      k     1      m 

An  equation  a  =  3  is  derivable    from  a  system  E  of 

functional  equation  if  there  is  a  chain  of  equations 

a,  =  . . ,  =  a   (n  >  1)   such  that  a,  =  a,  a   =  B,  and  for 
1  n    —  1     '   n 

each  t,  ot^,-.  is  obtained  from  a   by  replacing  a  part  y  of  a 
by  6,  where  y  =  "5  or  6  =  y   belongs  to  E. 

A  system   E (s , f ,u, , . . . )  of  functional  equations  defines 
a  particular  function  f„  if  f„  is  the  unique  function  f  for 
which  s  is  the  successor  function  and  there  are  functions 
u  ,u„,...   so  that  E (s, f ,u  ,U2, • . . )  holds.   Since  every 
hyperarithmetical  function   can  be  defined  by  some  system 
of  functional  equations  (J.  Robinson  [1967]),  there  is  in 
general  no  way  to  compute  the  values  of  a  function  defined 
by  a  system  of  functional  equations. 

Example  4.1.8  (J.  Robinson  [1968])    The  equations 

do  =  o  io  =  o 

ds  =  ssd  is  =  si 

define  the  zero  function  o (x)  =0,  the  double  function  d(x)  =  2x, 
and  the  identity  function  i(x)  =  x. 

Proof;   The  equation  ds  =  ssd  gives:   d(x)  =  2x  +  d(0) .   Thus, 
the  only  possible  fixed  point  for  d  is  0 .   The  equation  do  =  o 
implies  that   Range (o)  =  0.   Hence   o  is  identically  0  and 
d(x)  =  2x.   The  equation   is  =  si   gives   i (x)  =  x+  i(0). 
Since   o  is  the  zero  function,  we  have  that  i(0)  =  0. 
Therefore  i  is  the  identity  function.  g 

Theorem  4.1.9  (J.  Robinson  [1968])    A  function  f  of  one 
variable  is  recursive  if  and  only  if  there  is  a  system 
E (s ,  o,  f ,u-  ,  .  .  .  )   of  functional  equations  such  that 

(i)   E  has  a  unique  solution  in  which  s  is  the  successor 

function  and  o  is  the  zero  function.  In  this  solution, 
f  is  the  given  function. 

(ii)  For  all  natural  numbers  n  and  m,  the  equation  fs  0  =  s  0 

is  derivable  from  E  if  and  only  if  f(n)  =  m  . 
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If  (ii)  holds,  thon  we  say  that  the  values  of  f  can  be 
derived    from  E.   To  obtain  E  for  a  recursive  function  f,  start 
with  the  equations  00=0  and  os  =  o  and  adjoin   a  sequence  of 
equations  corresponding  to  successive  definitions  of  new 
functions  by  compositions   and  general  recursions  from  o,  s, 
and  functions  already  defined,  ending  with  a  definition  of  f. 
This  is   possible  by  Theorem  4.1.2.   E  then  satisfies  (i)  and 
(ii)  so  that  given  such  an  E  we  can  effectively  compute  the 
values  of  f.   It  follows  that  equations  of  the  form  fs  o  =  s  o 
are  decidable;  we  simply  systematically  derive  all  possible 

equations  from  E.   For  each  k,  we  will  eventually  derive  an 

k      n 
equation  of  the  form   fs  o  =  s  o.   Thus,  the  equation 

fs  o  =  s  o  is  derivable  if  and  only  if  m  =  n.   What  happens 

to  arbitrary  expressions  a,  .  .  .a,  =  b^  . . .b    is  not  clear. 
-^    ^  1     k     1     m 

It  will  be  shown  in  the  next  section  that  equations  true  for 
the  natural  numbers  are  not  derivable.   We  are  interested 
in  systems  E  of  functional  equations  for  the  following 
reason : 

The  functions  of  E  can  be  thought  of  as  generators  and 
the  equations  as  relations  of  a  semigroup,  since  a  set  of 
functions  with  the  operation  of  composition  is  always  a 
semigroup  (i.e.,  an  associative  system).   Derivability  from 
a  system  of  functional  equations  is  analogous  to  derivability 
from  the  relations  of  a   semigroup.    Thus,  our  goal  is  to 
construct  decidable  semigroups   by  constructing  decidable 
systems  of  functional  equations. 


4.2   Systems  of  Functional  Equations — Decidability 

Definition  4.2.1    Let  E  be  a  system  of  functional  equations 
An  expression   f^...f,  is  called  a  computation    if 
(i)   it  is  of  the  form  f,...f,_,o,  where  o  is  the  zero 

function  and  none  of   f,,...,f,_i  are  'o'; 
or, 
(ii)  it  can  be  transformed  into  an  expression  of  the  form 

g^  . . .g  o   using  the  equations  of  E. 

39 


As  stated  in  Section  4.1,  if  a  system  E  of  functional 
equations  defines  a  recursive  function  f  and  E  also  defines 
all  functions  used  in  defining  f,  then  all  equations  between 
computations  are  decidable  (all  such  equations  are  equivalent 
to  equations  of  the  form   f^...f,s  o  =  g, . . .g . s  o) .   We  will 
now  consider  other  equations,  i.e.,  noncomputations .   Here  we 
give  examples  of  very  simple  systems  of  functional  equations 
from  which  equations  true  about  the  natural  numbers  are  not 
derivable . 

Example  4.2.2.  Let  E  be  the  following  system  of  equations: 

(1)  oo  =  o 

(2)  OS  =  o 

(3)  do  =  o 

(4)  ds  =  ssd 

These  equations  define  both  the  zero  function   o    and  the 
double  function  d  (4.1.8).   We  claim  that   od  =  o  is  not 
derivable.   We  show  this  by  showing  that  o  cannot  be  obtained 
from  od   using  the  above  equations. 

The  only  way  'd'  can  be  removed  is  to  use  the  equation 
'do  =  o' .   But  there  is  no  equation  which  can  be  used  to  get 
'o'  to  the  right  of  'd'.   Thus,  ' od '  cannot  be  transformed 
into  '  o  '  .  r-, 

]^ 
However,  note  that  ods  o  =  o  is  provable  for  all  k  >^  1 

k       2k 
(ods  o  =  OS   o  =  o) ,  and  oso  =  oo  =  o .   Thus,  here  we  have  a 

situation  where  f(x)  =  0  is  provable  for  all  natural  numbers 

X,  yet  f  =  o  (the  zero  function)  is  not  provable. 

In  doing  derivations  from  systems  of  functional  equations 

E,  we  are  dealing  with  formal  systems.   Indeed,  the  natural 

numbers  W  are  a  model  of  the  equational  system  (as  defined 

in  Chapter  2)   whose  function  symbols  are  those  of  E  and 

whose  axioms  are  the  equations  of  E.   Thus,  the  equation 

od  =  o  is  true  if  we  consider  d  and  o  to  be  the  double  function 

and  the  zero  function,  respectively,  defined  on  the  natural 

numbers,  but  the  fact  that  it  is  not    provable  means  that 
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there  exist  interpretations  in  which  it  is  not  true .   The 
following  is  a  possible  interpretation  for  the  system  given 
in  the  example: 

The  domain  consists  of  three  distinct  elements  a, ja^^a^ 
The  functions  o,  d,  s  are  defined  as  follows: 


o 

d 

3 

^1 

^1 

^1 

^1 

^2 

^2 

^3 

^2 

^3 

^3 

^3 

^3 

The  equations  in  the  example  are  satisfied.   However, 

od(a„)  =  ©(a^)  =  a^   and   0(^9^  ~  ^2  '  ^•^•'  0^(32)  7^  °(^2^ 

Following  are  some  more  examples.   We  show  certain 
equations  are  not  provable  by  constructing  interpretations 
in  which  they  are  not  true. 


Example  4.2.3 


00  =  o    do  =  o    go  =  so 
OS  =  o    ds  =  ssd  gs  =  sg 


The  functions  defined  are  o,  s,  d,  and  g(x)  =  x+1 .   g  =  s  is 
not  derivable.   This  is" obvious  because  no  relations  are 

applicable t  but  it  is  something  we  might  expect  to  be  derivable, 

k       k 
especially  since   gs  o  =  ss  o  is  derivable  for  all  k.   The 

following  is  a  model  in  which  g  =  s  is  not  true. 


o  s  g  d 

^1    ^1  ^1  ^1  ^1 

^2    ^1  ^2  ^3  ^2 

a     a  a  a^  a^ 


The  equations  above  are  satisfied,  but  s(a2)  7^  g(a2) 
and  s(a^)  7^  g(a). 
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Example  4.2.4 


00  =  0 
OS  =  o 


do  =  o 
ds  =  ssd 


10  =  o 
is  =  si 


(i  is  the  identity  function).   id  =  d  is  not  derivable; 


1  o  d  s 

^1    ^1  ^1  ^1  ^1 

^2    ^3  ^1  ^2  ^1 

a    a  a^  a3  a^ 


Here  idCa^)  =  ^(^o^  ^  ^3  ^"^   d(a2)  =  a^  ,  i.e.,  id(a2)  7^  '^(^2^ 


4.3   s'^  -Decidable  Semigroups 

Let  E  be  a  system  of  functional  equations  defining  some 
recursive  function  f  and  all  the  auxiliary  functions  needed 
to  define  f.   For  such  a  system,  all  equations  of  the  form 
gs  o  =  s  o  (and  therefore  all  computations)  are  decidable. 
In  what  follows,  we  show  how  to  modify  E  so  that  it  still 
computes  the  same  functions,  but  so  that  all    equations  are 
decidable.   The  idea  is  to  modify  the  defining  equations  of 
the  given  system  so  as  to  be  able  to  distinguish  between 
computations  and  noncomputations ,  and  avoid  indefinitely  long 
derivations  involving  noncomputations,  while  not  changing 
any  equations   gs  o  =  s-^o   derivable  from  the  original 
system,  i.e.,  to  change  the  system  in  such  a  way  that  M    is 
still  a  model . 

We  remark  that  the  method  used  in  the  previous  section 
to  show  that  certain  equations  are  not  derivable  is  not 
really  practical:   while  all  models  constructed  there  were 
simple,  in  general,  there  is  no  a  priori  bound  on  the 
cardinality  of  the  model  we  need,  even  more,  there  is  no 
guarantee  that  the  model  is  even  finite. 
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Let  E  be  a  system  of  functional  equations  defining  the 
recursive  function  f  and  all  the  functions  needed  to  define 
f.   Further,  suppose  E  includes  the  following  equations: 

(1)  io  =  o   I       definition  of  the 

(2)  is  =  si  J         identity  function   i (x)  =  x 

(3)  of  =  o         for  all  functions  f   in  E 

n  n 

(4)  if  =  f  i        for  all  functions  f   in  E 

n    n  n 

Let  E'  be  the  system  with  the  same  function  symbols  as 

E  and  the  following  equations:   For  all  equations  a  =  6  in  E 

(a  =  B  is  a,  .  .  .a   =  b^  .  .  .b  ) 
1     m     1     n 

(i)  if  a  =  6  does  not  involve  o  and  n  <  m,  then 

,     ,   . m-n  ^  „ , 
a,  .  .  .a   =  b,  .  .  .b  1     G  E '. 
1    m     1     n 

(ii)  If  a  =  3  does  not  involve  o  and  m  <  n,  then 

.  n-m    ,     i_   ^  T-.  I 
a,  , . .a  1     =  b,  .  .  .b  G  E  '  . 
1    m        In 

(iii)  Otherwise,  a  =  B  e  e' . 

By  the  remarks  at  the  end  of  Section  4.1,  we  can  associate 
with  E  a  semigroup.   We  will  interchangeably  refer  to  E  as  a 
system  of  functional  equations  or  as  a  semigroup.   Also,  the 
terms  "equations"  and  "relations"  will  be  used  interchangeably. 

Proposition  4.3.1    e] —  fso  =  so**E'  | —  fso  =  s-^o. 

Proof:  (=*)   Suppose  E  | —  fs  o  =  s-'o. 

The  relations  of  E'  differ  from  those  of  E  in  the  number  of 
i's  present.  Thus,  to  obtain  a  proof  of  fs  o  =  s-'o  from  E', 
first  use  (1)  to  generate  enough  i's,  then  use  the  same  proof 
as  from  E,  with  the  corresponding  relations  in  E',  using  (4) 
to  move  the  i's  into  position  and  at  the  end,  using  (1)  to 
remove  excess  i's. 

(<=)   Suppose  E'  I —  fs  o  =  s-'o. 
To  obtain  a  proof  from  E,  use  the  relations  in  E  corresponding 
to  those  in  the  proof  from  E',  using  (1),  (4),  and  (3)  to 
generate,  move,  and  remove  i's.  q 
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Note  that  the  only  difference  between  a  proof  from  E 

and  a  proof  from  E'  of  fs"'"o  =  s-'o  is  in  the  number  of  i's 

present.   W  is  a  model  of  E'  with  i  interpreted  as  the 

identity  function.   We  remark  that   if   =  f   ,  while  true 

n     n 

for  /^  is  probably'  not  provable. 

Proposition  4.3.2    Let  a   a_  . . .  a   be  any  expression  on  the 
function  symbols  of  E ' .  Then  it  is  decidable  whether  or  not 
a^a„...a   is  a  computation. 

Proof:   By  definition,  a,... a   is  a  computation  if  and  only 

if  it  is  equal  to  an  expression  of  the  form  b, . . .b  o  under 

i    m 

the  relations  of  E'.   We  consider  several  cases: 

(i)   a, . . .a   contains  o. 
1     n 

Then  either  the  spelling  of  a,  . . .a   is  a,  ...a   ,0 

in      1     n-i 

(where  none  of  a, ...a   ,  are  o)   or  a,.., a   is 

1     n-1  1     n 

identically  equal  to   a,... a,  ,  oa,  ,.,... a   , 

1     k-1   k+1     n 

where  none  of   a,,...,  a,  ,  are  o  (some  of  a,  ,,,...,  a 

1      k-1  k+1      n 

could  be  o) .  Then,  by  using  relations  (3), 

a-]. -.a   =  a,...a,_,o.    Thus,   a,  .  .  .a   is  a  computation. 

(ii)  a, . . .a   does  not  contain  o. 
1     n 

First  note  that  if  a  =  g  is  a  relation  in  E'  not  involv- 
ing o,  then  length(a)  =  length(6).   Let  k  be  the  number 

of  function  symbols  in  E*.   To  dedide  if  a, , . .a   is  a 

In 

computation,  apply  the  relations  of  E'  to  a^  . . .a  . 
Since  the  left-hand  side  and  the  right-hand  side  of 
relations  not  involving   o   have  the  same  length,  we 
have   a  bound  on  the  maximum  number  of  steps  (i.e.,  the 
number  of  relations  used)  needed  to  reach  an  expression 
involving  o:  k    (this  is  the  number  of  distinct  words 


of  length  n  over  an  alphabet  of  k  letters) .  If  o  does 

not  appear  within  this  number  of  steps,  then  a,  . . .a 

"^  1     n 

is  not  a  computation.  -. 
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Proposition  4.3.3    E'  is  decidable,  i.e.,  there  is  an 
effective  procedure  to  decide  for  any  two  words  a, 3  whether 

E'  |—  a  =  6. 

Proof :   The  following  is  a  procedure  to  decide  whether 
E'  ) —  a  =  3:   First  check  to  see  if  both  a  and  3  are  computa- 
tions . 
(i)   If  so,  then  both  a  and  3  are  equal  to  expressions  of 

the  form   b,  ...b,  o.   Since  E'  defines  all  the  functions 

1     k 

b,,...,b,  and  all  the  functions  are  recursive,  the  values 

of  bw...,b   are  computable  from  E'.   Thus,  a  can  be 

transformed  into  an  expression  s   o;  3  can  be  transformed 

kp 
into  an  expression  s   o  ,  and  so,  E'  \ —  a  =  3  **  k,  =  k„. 

(ii)  If  not,  then  at  least  one  of  a  and  3  is  not  a  computation, 
say  a.   Then  we  can  generate  a  list  of  everything  deriv- 
able from  a  (this  is  finite  and  can  be  obtained  in  a 
finite  number  of  steps)  ,  and  E'  | —  a  =  3  **  3   appears 
on  this  list. 
It  is  clear  that  if  E'  | —  a  =  3,  then  this  procedure  will  verify 
that,  and  if  E'  {-/■  a    =    ^    ,      this  will  be  shown  in  a  finite 
number  of  steps.  p, 

The  difficulty  of  deciding  whether  E'  \ —  a  =  3  lies  in 
the  difficulty  of  computing  the  functions  that  E'  defines. 
It  is  not  difficult  to  decide  whether   a^  ' ' '^v    ^^   ^   computation. 
Furthermore,  the  fact  that  a  =  3  is  decidable  when  a  and  3  are 
both  computations   depends  on  the  fact  that  E'  defines  recursive 
functions.   This  suggests  that  if  we  had  a  bound  on  the 
computational  complexity  of  the  functions  defined  by  E '  we 
would  have  a  bound  on  the  degree  of  difficulty  of  E'. 

Proposition  4.3.4    For  n  >^  4 ,  there  exist  strictly  £  -decidable 
semigroups . 

Proof:   For  n  >  4,  let  S   be  a  set  in  s"  -  s"^"-*"  (cf.  3.1.9). 
—         n   _ 

By  (3.1.6  (vi) ) ,   S    and   S   (the  complement  of  S  )  can  be 

n        '^    _      n         ^  - 

enumerated  by  functions   f    and  f   in  £  .   By  (4.1.7)  f   and  f 
^  n       n  -^  n      n 
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can  be  defined  from  functions  of  one  variable  in  £  .   Let  E 
be  the  system  of  functional  equations  defining  f   and  f 


together  with  the  equations : 

gf   =  o 
^  n 

gf   =  sof 
^  n      n 

g  is  a  0-1  valued  function  defined  by  general  recursion. 

Further,  gs  o  =  o  if  and  only  if  k  e  range (f  ) ,  and 

k  G  range (f  )  if  and  only  if  k  G  s   ,  i.e.,  g  is  the  character- 
^    n  ■'  n  ^ 

istic  function  of  S   and  therefore   g  G  £  .   Thus,  the  word 

n  ^^ 

problem  for  E   cannot  be  lower  than  £  ,  since  a  decision 

procedure  for  E   could  be  used  for  computing  S  . 

We  now  show  that  E   is  decidable  at  level  6  .   Suppose 

n 

we  wish  to  decide  if  w^  ~  "9 •   By  4 . 3 . 2  and  4.3.3,  the 
number  of  steps  (i.e.,  the  number  of  relations  that  need  to 
be  used)  to  decide  if  w,  and  w_  are  computations  and  whether 
w,  =  w     when  either  w,  or  w   (or  both)  are  not  computations 
is  an   £   function  of   max{ length (w, ), length (w„ )} . 
Suppose  w,  and  w„  are  both  computations : 


"1 

-  gi-. 

••V 

^2 

=  h3_.. 

.  .h   0 

m 

Let  p(i)  be  the  maximum  number  of  steps  needed  to  compute 

fs^o  for  any  f  e  e    (p  e  a"^)  . 

n 

Let  q(i)  be  the  maximum  number  of  steps  needed  to  compute 
fs  o  for  any  f  e  E   ,  f  t^  g. 

We  can  assume  that  p  and  q  are  strictly  increasing.  Since 

3        3 
all  the  functions  of  E   except  g  are  in  £  ,  q  G  £  ,  and  for 

all  functions  f  in  E   , 

n 

f  (k)  <  q(k)   for  all  k 

(easily  seen  to  be  true  for  all  functions  f  not  equal  to  g; 
it  is  true  for  g  since  g  is  0-1  valued) . 

Thus,  if  f  is  an  arbitrary  function  in   £  ,  to  compute 
fs''"o  requires  <_   p(i)  steps  and  the  length  of  the  result  is 
<_   q(i).   Using  this  fact  k  times,  the  maximum  number  of  steps 
needed  to  compute  g,  ...g,o  is 
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p{0)  +  pq(0)  +  pqq(O)  +  •••  +  pq'^  "^(0)  <_   kpq^  ^(0)  . 

Therefore,  to  decide  "w,  =  w_"  when  w^  and  w_  are  computa- 

k— 1  i-  z 

tions,  we  need  <  kpq    (0)  steps,  where  k  =  max{length (w, ) , 

length  (w_)},  and  this  is  in  £   (n  >^  4)  whenever  p  and  q  are. 

D 

For  the  purpose  of  constructing  &  -decidable  groups  we 

need  to  modify  the  semigroups  E  .   Let  E*  be  the  following 

semigroup: 

generators:   f  ^  ,  .  .  .  ,  f,  ,q  ,q,  ,h 


relations 


where  f,  ,...,f,  are  the  function  symbols  in  E 
Ik  -^  n 


(1)  q,R.   =  q,R.    for  all  relations  R.  =  R.   of  E 
^1  1^    ^1  i2  ii    i2      n 


(2)  q,f.  -    f.qi     for  i  =  1,2,.. .,k 

1  1     1  1 

(3)  hq^oh  =  q 

(the  h's  are  "end  markers"  and  q,  marks  our  place  in  the  course 
of  a  derivation) . 

Proposition  4.3.5    E  \ —  hq,g,...goh  =  q**E   I —  a    .  .  .g   o   =    o 
^ n  '    ^1^1    ^m  n  '   ^1    ^m 

Proof:  (*=)   Suppose  E   | —  g^  .  .  .g  o  =  o. 

Then  we   can  get   a  proof  of  hq,g,...g  oh  =  hq,oh  from  E   by 

using  the  relations  of  E   corresponding  to  those  of  E   used, 
^  n       '^      ^  n 

and  using  (2)  to  move  q^  into  position.  Thus, 

E   ] —  hq, g,  . . .g  oh  =  hq, oh  and  using  (3)  ,  E   | —  hq, oh  =  q  . 

(=*)   Suppose   E  \ —  hq,g,  ...g  oh  =  q . 

Then  E   | —  hq,g,  ...g  oh  =  hq,oh.   There  exists  a  sequence 

a,  ^a_  -»■  ...  ->a.  where  a,  is  hq,  g,  .  .  .g  oh  and  a  .  is  hq,  oh. 
12  j         1      ^1^1    ^m        j       i 

By  removing  h's  and  q,  from  the  a's  above  and  deleting 
duplicate  words  we  have  a  proof  from  E  .  ^ 
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Proposition  4.3.6    If  w  is  an  arbitrary  word  in  E   and  for 

all  u  in  E   ,  u  =  o  is  &  -decidable  (relative  to  E  ) ,  then 
n  n 

w  =  q  is     fi  -decidable. 

Proof:   Suppose  w  is  not  "q".   Since  h  and  q   are  not  intro- 
duced (except  by  (3)),  if  w  is  not  of  the  form  hAq  Bh 
(A  and  B  are  words  on  f.),  then  w  7^  q .   Therefore,  suppose  w 
is  of  this  form.    By  4.3.5,     hAq^Bh  (i.e.  w)  =  q  *»  AB  =  o 

in  E   ,  which  is   S  -decidable.  n 

n  '-' 
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CHAPTER  5 
HNN  EXTENSIONS 

The  Higman,  Neumann,  Neumann  (HNN)  extension,  introduced 
by  G.  Higman,  B.  H.  Neumann,  and  H.  Neumann  in  1949  is  one 
of  the  basic  constructions  of  group  theory.   It  is  used  by 
Miller  [1971]   to  prove  various  unsolvability  results  about 
groups,  and  a  topological  interpretation  is  given  in  Schupp 
[1973].   We  are  interested  in  HNN  extensions  because  they 
provide  a  way  of  constructing  a  group  G  from  a  group  G'  in 
such  a  way  that  the  word  problem  for  G  is  reducible  to  the 
word  problem  for  G'.   In  this  chapter  we  examine  the  difficulty 
of  the  word  problem  for  HNN  extensions  in  general.   In  the 
next  chapter,  we  will  exhibit  a  sequence  of  groups  G   ,  each 

of  which  is  a  series  of  HNN  extensions,  and  has  word  problem 

pn 
in  £  . 

The  following  notational  conventions  will  be  used: 

(g  ,...,g   I  R   =  1,...,  R   =1)   denotes  the  group  presented 

by  generators   g^  , .  .  .  ,g   ,  subject  to  the  defining  relations 

1       n 

R,  =1,  R„=1,...,R  =1.   Generally,  lower  case  letters 
12m  ■' 

will  be  used  for  generators  of  a  group  while  upper  case 
letters  will  be  used  for  groups  or  for  words  in  some  specified 
group.   If  Q  is  the  set  of  words  in  some  group,  <Q>  will 
denote  the  subgroup  generated  by  Q .   If  U  and  V  are  words  of 
a  group  G,  then  U  =  V  means  that  U  and  V  are   identical  as 
words;  U  =  V  means  that  U  and  V  are  equal  in  the  group  G. 
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5 . 1   Definitions  and  Main  Results 

Definition  5.1.1  (Rotman  [1973])    Let  G  =  (SJD)  be  a  group 

* 
presentation.   A"  second  presentation  G   has  basis   G  and 

stable    letters       {p   I  v  e  v]  if 

G*  =  (S,p^,  V  e  V  I  D,  P~"^A  p^   =  B  ,  j  e  J) 

where  A.  and  B.  are  words  on  S. 

For  each  v  €  v  let  J (v)  denote  the  following  subset  of 
the  index  set  J: 

J(v)  =  {j  s  J|p   is  involved  in  new  relation  j} 

={iGJ  |v=v.  and  p   A.p   =  B.}  . 

Let  A,  B,  A(v),  and  B(v)  be  the  following  subgroups 
of  G: 

A  =  <A.  I  j  e  J> 

B  =  <B .  I  j  e  j> 

A(v)  =  <A.  I  j  G  J  (v)  > 

B(v)  =  <B.  I  j  e  J(v)  > 

Definition  5.1.2  (Rotman  [1973])    The  isomorphism    condition 
is  that  for  each  v  e  V,  there  is  an  isomorphism  <^    :    A(v)->-  B(v] 
under  which   4)  (A.)  =  B.   for  all  j  e  J  (v)  . 

Definition  5.1.3  (Rotman  [1973])    A  group  G*  is  an  HNN 
extension   of  G  with  stable  letters  {p   I  v  e  V}  if 

(i)   G*  has  basis  G  and  stable  letters   {p  |v  e  v} 
(so  presentations  of  G   and  G  are  given) ,   and 

(ii)  the  isomorphism  condition  holds. 

Thus,  an  HNN  extension  is  formed  by  adding  inner  auto- 
morphisms. Another  definition,  given  for  one  stable  letter 
t  is  : 
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Definition  5.1.4    Let  G  be  a  group,  and  A  and  B  be  subgroups 
of  G  with  c[) :  A  ->■  B   an  isomorphism.   The  HNN  extension  of  G 
relative  to  A,  B,  and   4)   is  the  group 

G*  =  <G,  t  I  t~  a  t  =  $ (a) ,  a  e  A> 

The  importance  of  HNN  extensions  lies  in  the  following: 

Theorem  5.1.5  (Brit  ton's  Lemma)    Let  G   be  a  Britton  exten- 
sion of   G  =  (S  I  D)  with  stable  letters  {p   I  v  e  V} . 

If  w  is  a  word  involving  at  least  one  stable  letter  and  if 

* 
w  =  1  in  G  ,  then  w  contains  a  subword  of  the  form 

— e   e 
p   Cp   where  e  =  +  1   and  C  is  a  word  on  S. 
^v      ^v  — 

Moreover,  if  e  =  +1,  then  C  is  equal  in  G  to  a  word  on 
{A.  I  j  G  J(v)}   (i.e.,  C  is  equal  in  G  to  an  element  of 
A(v)  C  A) ;   if  e  =  -1,  then  C  is  equal  in  G  to  a  word  on 
{B.  I  j  G  J(v)}   (i.e.,  C  is  equal  in  G  to  an  element 
of  B(v)  c  B)  . 

The  combinatorial  significance  of  stating  "C  is  equal 
in  G  . . . "   is  that  C  can  be  transformed  into  a  word  in  A(v) 
or  B(v)  without  inserting  P^P^   or  p   p  . 

Definition  5.1.6  (Rotman  [1973])    A  word  of  the  form  P^Cp^ 
is  called  a  pinch    if  e  =  +  1  and  the  "moreover"  paragraph 
above  holds  for  C. 


Remark.   Instead  of  "w  =  1  in  G*  "   in  the  above  theorem 


we  could  have   "w  =  T  in  G* , "   where  T  is  any  word  in  G 
and  the  theorem  would  still  hold. 

Definition  5.1.7  (Rotman  [1973])  Given  G*  with  basis  G 
and  stable  letters  P  =  {p  |  v  g  v)  then  a  word  w  is  p  - 
reduoed    (for  some  fixed  p  )  if  w  contains  no   pinch 


V 


p  Cp    as  a  subword.   We  say  that  w  is  ^p-reduoed   if  w  is 

p  -reduced  for  all  p  g  p,  p[w]  will  denote  the  p-reduced  w, 
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It  is  easy  to  see  that  an  arbitrary  word  w  is  equal 
in  G   to  a  p-reduced  word:  the  relations  of  G   allow  us  to 
replace  each  pinch  by  a  word  involving  two  fewer  occurrences 
of  a  stable  letter.   Thus,  if  w  =  1,  then  w  can  be  p-reduced 
successively  to  obtain  a  word  on  S  which  equals  1.   If  w 
cannot  be  p-reduced,  then  w  t^  1  in  G  .   This  is  a  procedure 
(not  necessarily  effective)  to  solve  the  word  problem  for  G 
To  state  this  more  formally,  we  introduce  the  following 
notation : 

(?  3)  T{6)   denotes  the  problem  of  deciding  for 

an  arbitrary  word  6,  whether  T{B)  is  true 
Let  A(v)   be  a  variable  ranging  over  words  in  A(v). 
Let  B(v)   be  a  variable  ranging  over  words  in  B(v) . 

* 
Proposition  5.1.8    The  problem  of  obtaining  p[w]  for  w  g  G 

is  reducible  to 

(i)   (?u  e  G)  (3a(v))  (u  =  A(u))   in  G 

(ii)  (?u  e  G)  (3B(v)  )  (u  =  B{v)  )   in  G  . 


Proof:   By  definition  of  p-reduction 


D 


Proposition  5.1.9    Let  T  be  a  variable  ranging  over  some 
particular  set  of  words  of  G.   Then  ( ?w  e  G*)  (3T)w  =  T  in  G' 
is  reducible  to 

(1)  (?u  e  G)  3A(v)u  =  A(v)   in  G 

(?u  e  G)  3B{v)u  =  B(v)   in  G 

(2)  (?u  e  G)  (3T)u  =  T       in  G  . 

Proof:   Since  any  word  is  equal  in  G   to  a  p-reduced  word, 
we  have  {3T)  (w  =  T  in  G*)  ^    (3T)  (p[w]  =  T  in  G*)  . 
We  will  now  show  that 

(3T)  (p[w]  =  T  in  G*)  *»  p[w]  is  p-free  and 

(3T)  (p[w]  =  T  in  G)  . 

The  implication  (•^)  is     clear.  To  see  (=»■),  suppose 
p[w]  is  not  p-free,  but  (3T)(p[w]  =  T  in  G*).   Then, 
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-1  * 

p[w]T    =  1  in  G   and  Britton's  lemma  applies.   Thus, 

—  1  e   ~e 

p[w]T    contains  a  subword  p  Cp    and  C  is  of  the  required 

form.   But  p  Cp   must  be  entirely  contained  in  p[w]  since 

T  is  p-free,  contradicting  the  fact  that  p[w]  is  p-reduced 

Therefore, 

* 

(3t)  (p[w]  =  T  in  G  )  =*  p[w]  is  p-free  and 


(3T)  (p[w]  =  T  in  G) 


D' 


* 


Corollary  5.1.10   w  =  1  in  G   is  reducible  to 

(1)  (?u  e  G)  (3A(v))  (u  =  A(v)  in  G) 

(?u  e  G)(3B(v))(u  =  B(v)  in  G) 

(2)  (?wGG)w=l   inG. 

Proof:   Take  1  as  the  range  of  T  in  5.1.9. 

D 

Proposition  5.1.11  (Rotman  [1973])    Let  G*  be  a  Britton 

extension  of  G  with  stable  letters  P  =  {p  |v  G  V}. 

^^  em  V  f,       f^ 

Assume   U  =  R^p   R, . . , p   R    and  W   =  L„p  L. • • .p„  L    are 
C^v   1    ^v  m  O'^v   1     V  n 

P-reduced  words,  where  each  e,  f  =  +  1   and  none  of  the 

(possibly  empty)  words  on  R  or  L  involve   p  .   If  U  =  W 

in  G  ,   then  m  =  n  and  (e,,...,e  )  =  (f ,,..., f  ). 

e    -'■_i  _f  ni      1       n 

Moreover,  the  word   p   R  L   p   "   is  a  pinch. 

'^v  m  n  '^v         "^ 

5.2   S  -Decidability  of  HNN  Extensions 

We  will  now  discuss  the  S   solvability  of  the  word 
* 
problem  for  G  .   Suppose  (for  the  rest  of  this  chapter) 

that  G*  is  finitely  presented.    Then  there  are  only  a  finite 

number  of  relations   p  A.p    =  B.   so  that  if  we  have  a 

'^v  .  I'^V  .      1 

-1*3^       * 
word  of  the  form  p   A  p    where   A   =  word  on  A (v.)  and 

•^v .   "^v .  1 

D     3 

we  p   -reduce,  then  there  is  a  constant  c  such  that 
"^v  . 

■^   *  -1  *        *  * 

length  (B  )  (where   p   A  p    =  B  )  is  <^  c  length  (A  )  . 

Vj     Vj 
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Proposition  5.2.1    Let 


(?U  G  G)  (3A(v))  (U  =  A(v)  in  G) 
(?U 


G)  (3B(v)  )  (U  =  B(v)  in  G] 
to  obtain  p[w]  is  in  S 


(*) 


be  decidable  at  level  £  '.   Then  the  number  of  steps  needed 

,n+l 


Proof :   Assume  that  if  U  =  A(v)  in  G,  then  we  can  explicitly 

s  of  the  elements  c 
Assume  also  that  f 


obtain  a  representation  U   of  U  in  terms  of  the  elements  of 


A(v)  in  <_   f(length(U))  steps,  f  e  £  , 
is  an  increasing  function. 

In  order  to  p-reduce  once,  <_   f  (length  (U))  steps  are 
required,  the  length  of  U   is  <_   f  (length  (U))   and  by  the 
above  remark,  the  length  of  the  new  word  is  <_  cf  (length  (U)  )  . 
p-reducing  takes  the  most  number  of  steps  when  U  is  of  the 

,  for  then  the  length 


.      -1„   -1„   -1 
form   p   U,p   U„p 

^1  1^2  ^"^^3 


•  p   U„p   U,p 
^3  ^    ""2    1^1 


of  the  word  to  which  the  procedure  for  (*)  is  applied  depends 
on  the  result  of  the  previous  application,  i.e.,  to  p-reduce 
U  twice  can  require 


<  f(length(U))  +  f (cf (length (U) ) )  <  2f (cf (length (U) ) )  steps 


to  reduce 
once 


to  reduce  a 
second  time 


and  the  length  of  the  result  is  <_   cf  (cf  (length  (U)  ))  .   Thus, 
to  p-reduce  m  times,  we  need 


m 


<  m(cf)  (length(U))  =  g(m,  length(U))  steps 


and  g  G  S 


n+1 


(3.2.2) 


The  maximum  number  of  times  a  word  U  can  be  p-reduced  is 

<_  length  (U)  .  Thus,  to  p-reduce  U  requires 

<  g(length(U),  length (U))  steps,  which  is  in  S 


D 
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There  is  an  analogy  between  p-reducing  words  and 
functions  defined  by  iteration  from  functions  in  6  . 
Let  h(x,y)  be  the  following  function: 

h(x,0)   =   f(x) 

h(x,y+l)  =   gh(x,y) 
i.e., 

h(x,y)   =   g^CfCx))  . 

If  f  and  g  are  in  S  ,  then  h  is  in  S     (cf .  3.2.2)  . 
The  complexity  of  h  here  depends  on  the  size  of  g.   If  g 
is  nondecreasing ,  then  g  is  being  applied  to  larger  and 
larger  arguments,  causing  h  to  grow  very   fast. 

As  seen  in  the  proof  of  5.2.1,  the  procedure  for 
p-reducing  might  have  to  be  applied  iteratively  to  larger 
and  larger  words ,  each  resulting  from  a  previous  application 

of  the  procedure.   This  causes  the  decision  procedure  to 

,    .   „n+l 
be  in  £ 

Suppose  in  the  above  definition  of  h  we  have  a  bound  on 

g,  g(x)  <_   e(x)  ,  e  g  S    ,  e  nondecreasing.  Then 

h(x,y)  =  g^f  (x)  <^  g^'-'-ef  (x)  ^  e^f  (x)  e  g^  (3.2.2) 

i.e.,  h(x,y)  is  bounded  by  a  function  in  S  .   Therefore, 
h(x,y)  e  g^. 

Similarly,  we  will  show  that  if  after  p-reducing  a  word 
w  once,  the  length  of  the  resulting  word   is  <_  f(length(w)) 
and  f  e  S    ,  then  the  procedure  for  p-reducing  is  in  £  . 

Corollary  5.2.2    If  the  length  of  U   in  5.2.1  is 

<_  h  (length  (U))   where  h  is  a  function  in  6    ,  then  the 

number  of  steps  needed  to  obtain  p[w]  is  in  £  . 

Proof :   Using  a  proof  similar  to  the  one  above,  we  can  see 

that  the  number  of  steps  needed  to  p-reduce  w  m  times 

is  ^  mfh    (length(w))   which  is  in  £   (3.2.2).  g 
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Corollary  5.2.3    If  the  relations  involving  the  stable  letters 

*  -1 

p   of  G   are  of  the  form   p   Ap    =  A,  then  the  number  of  steps 
'^v  V  .   V  . 

needed  to  obtain  p[w]  'is     -'    -^   in  S  . 

Proof:  If  w  =  U'p~-'"U"p  U'"'  and  U"  e  A(v),  then  we  can  apply 
the  procedure  for  ^(*)  ^  in  5 . 2 . 1  to  the  word  U'U"U'''  (i.e., 
the  length  of  the  word  does  not  increase,  h(x)  =  x  in  5.2.2.  O 

Theorem  5.2.4    If  G  has  word  problem  in  s"  and  the  assumptions 
of  (5.2.2)  or  (5.2.3)  hold,  then  G*  has  word  problem  in  S  . 

Proof:   Suppose  that  to  p-reduce  U  e  G   requires  ;<  f  ( length  (U)) 

5teps,  f  s  g  ,  nand  to  decide  if  w  =  1  in  G  requires 

<_  g  ( length  (w))  steps,   g  e  g"^.    Then  to  solve  the  word 

problem  for  G*  requires,  for  U  e  G*  ^  f (length (U) ) +gf ( length (U) ; 

steps  (by  5.1.10).  q 

* 
Remark  5.2.5    In  order  to  p-reduce  a  word  in  G  ,  we  have  to 

solve  a  "subgroup  problem".   This  may  be  very   difficult.  We 

show  in  the   next  chapter  that  there  exist  groups  G  and  G* 

such  that  G  has  easily  solvable  word  problem,  but  G*  has  more 

difficult  or  even  unsolvable  word  problem  (i.e.,  G  has  a 

more  difficult  or  unsolvable  subgroup  problem) . 

Proposition  5.2.6    Let  G  =  (g,  t | t~  g  t  =  g~  ) .   Then  G  has 

2 
word  problem  in  £  . 

Proof :   G  is  an  HNN  extension  of  a  free  group  on  one  generator. 

But  G  is  also  the  semidirect   product  of  two  free  groups. 

2 
By  3.3.5   G  has  word  problem  in  S  .  LJ 

-1       2 

Proposition  5.2.7    Let  G  =  (g,t|t   gt  =  g  ) .   Then  G  has  word 

3 
problem  in  £  . 

Proof :   G   is  an  HNN  extension  of  a  free  group  with  stable 
letter  t. 

Let  w  be  an  arbitrary  word  in  G.   To  t-reduce  w,  we 
need  to  decide  the  following  two  questions  (by  5.1.8): 


56 


(i)  (?u  e  <g>)  (3v  e  <g>)u  =  v   in  <g> 

2 
(ii)  (?u  G  <g>)  (3v  e  <g  > )  u=  V   in  <g> 

(i)  is  trivial;  it  is  always  true;  and  we  can  obtain 

a  freely  reduced  representation   of  u  by  scanning  from  left 
to  right  and  reducing  gg    =1  and  g   g  =  1.   The  number  of 

relations  used  is  <_   length(u).   By  3.3.2,  the  procedure  is  in 

2 

S  .   To  decide  (ii),  the  procedure  is  similar  to  (i); 

just  scan  the  word  from  left  to  right,  replacing   gg    and  g   g 
by  1.   The  number  of  relations  used  is  <_   length  (U)  .   Then 
just  count  and  check  whether  or  not  the  number  of  g's  or  g   's 

in  the  remaining  word  is  a  multiple  of  two, 

2 
Thus,  the  procedure  for  (i)  and  (ii)  is  in  £  .   By  5.2.1, 

3 
the  procedure  to  t-reduce  w  is  in  S  .   By  the  proof  of  5.2.4, 

3 
and  by  3.3.3,   G  has  word  problem  m  £  .  p. 

Remark  5.2.8    By  the  same  proof  as  above,  we  can  show  that 

3 
the  following  group  has  word  problem  in  £  : 

I   -1        2 

G  =  (g,t-,,...,t   I  t.  gt.  =  g   for  i  =  l,...,n) 
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CHAPTER  6 


S'^-DECIDABLE  GROUPS 


The  word  problem  was  proved  to  be  recursively  unsolvable 
by  constructing  a  group  which  modeled   derivations  in  a  semi- 
group obtained  from  a  Turing  machine  computing  a  nonrecursive 
function.   We  use  the  same  scheme  for  obtaining  a  presentation 
of  a  group,  given  the  presentation  of  a  semigroup,  and  we 
show  that  if  the  semigroup  is  such  that  w  =  q  (w  is  an  arbitrary 
word  on  the  generators  of  the  semigroup,  q  is  a  fixed  generator) 
is  S   -  £     decidable  (n  >_  4)  ,  then  the  group  is  £   -  £ 
decidable.   The  techniques  used  are  similar  to  those  in  Boone 
[1964]  .   The  group  is  a  series  of  HNN  extensions 
G   c  G,  £  ^2  £  G^  c  G.  =  G;   we  show  that  problems  in  G.  are 
reducible  to  problems  in  G._,  ,  and  furthermore,  if  the  problems 
in  G._,  are  in  £  ,  then  the  problems  for  G.  are  in  £  .  The 
construction  of  the  groups  and  an  indication  of  the  proof  of 
£  -decidability  are  given  in  6.1,  Section  6.2   contains  the 
proofs,  and  6.3  contains  some  conclusions. 
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n 


6.1   Construction  of  £  -Decidable  Groups  (n  >  4) 


For  n  >  4,  let  S'  be  the  semigroup  given  at  the  end  of 
Section  4.3.   By  relabeling  the  generators  of  S  so  that  they 
are   q ,q, , f  ,  . . .  ,  f „  (i.e.,  reindexing  the  f's  and  letting 
h  be  some  fv^)  »  we  have  a  semigroup  S  with  presentation: 


S  =  (q,q^,f^,  •  •  •  /fj^ 


F.q.  G.  =  H.q.  K.,  i  e  I) 
1^1,  1     ^^2  ■"" 


where  F. ,  G. ,  H. ,  K.  are  (possibly  empty)  words  on 


,M},  q.  ,q.   G  {q^q-i}   and  S   is  such  that 

"""1   ""'2 
for  an  arbitrary  word  w   on   the  generators  of  S,   w  =  q  is 


C  =  ^f^lb  -  1, 


,n 


£  -decidable 


Remark  6.1.1    We  note  that  by  using  the  method  of  constructing 
a  new  semigroup  S_  from  a  given   semigroup  S,  given  at  the 
end  of  Section  4.3,  any  semigroup  can  be  made  to  fit  into 
this  scheme  (o  in  relation  (3)  would  be  replaced  by  some  word 
u  on  the  generators  of  S, ) .   We  would  then  have  w  =  u  in  S, 
■»  hq  wh  =  q  in  S_. 

We  now  present  the  scheme  for  constructing  groups  which 
will  be  seen  to  have  word  problem  in  £   (n  >_  4)  .  For  each 
choice  of  a  semigroup  S  as  above,  G  is  the  following  group: 


generators 
relations : 


q/qj^'f^/  •  •  •  ^ff4^r^   (i  G  I),   x,t,k 

(for  all  i  G  I   and   b  =  1,...,M) 

.-1.2 
f,    X  f,   =  X 
b      b 

-1.         .   -1 
r  .  f ,  X  r  .  =  f,  X 
1   b    1     b 

rT-'-F.q.  G.  r .  =  H.q.  K. 

t   r.t=r.  ,    t   xt=x 
1      1 

k   r.k=r.  ,    k   xk=x 
1      1 

,  -1  -1^  1     -I4- 
k   q   tqk  =  q   tq 
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e.     e 

The  following  notation  will  be  used:   If  X  =  f,  ...f 
^  1      m 

where    e.  =  ±1  (i.e.,  X  is  a  word  on  C) ,  then 

—       ^1        m 

X  =  f-,   ...f    .   If  X  and  Y  are  words  on  C,  then 
1     _  m  _ 

(Xq.Y)*  ^   Xq.Y.  Z   is  special    if   E  e  Xq.Y  where  X  and  Y  are 
positive  words  (i.e.,  no  negative  exponents)  on  C  and 

qj  e  {q/qj_}- 

Lemma  6.1.2  (Boone)    If  I  is  a  special  word,  then 
kE~''"tI  =  E"-*-  t  Z  k   in  G  if  and  only  if  E   =  q  in  S . 

Corollary  6.1.3    G  cannot  have  word  problem  solvable  at 
level  <  £  . 

Proof:   If  the  word  problem  for  G  were  solvable  at  level  <  £  , 
then  we  would  have  a  procedure  in  level  <   S   to   determine 
for  an  arbitrary  special  word  E,  whether  E  =  q  in  S .   By 
4.3.4  and  4.3.6  this  cannot  happen.  j-. 

Define  the  following  groups: 

G^   =   <x>,     the  free  group  on  x 

G,   =   (G„,f,  ,  b  =  1, ... ,m| relations  A^ ) 
1        U   b  i 

G_   =   (G,,r.,  i  e  I,  q,q  [relations  A  ) 
G^   =   (G„,  t  I  relations  A  ) 
G    =   ^'^B''^  I  relations  A  ) 

Theorem  6.1.4  (Rotman  [1973]) 
(i)    G,   is  an  HNN  extension  of  G„  with  stable  letters 

{fj^,  b  =  1,  .  .  .,M} 
(ii)   G,  *  <q,q-,>  is  an  HNN  extension  of  G,  with  stable  letters 

{q^q^^} 

(iii)  G„  is  an  HNN  extension   of  G,  *  <q,qi>  with  stable  letters 

{r^,  i  e  1} 
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(iv)   G^  is  an  HNN  extension  of  G^  with  stable  letter  t 
(v)    G   is  an  HNN  extension  of  G   with  stable  letter  k . 

The  proof  that  G  has  word  problem  in  &^   will  consist  of 
several  reductions.   These  are  shown  schematically  on  the 
following  page.  The  proofs  will  generally  consist  of  two  parts. 
We  will  show  that  a  particular  problem  is  reducible  to  simpler 
problems,  and  also,  that  if  the  simpler  problems  are 
S  -decidable,  then  so  is  the  original  problem.   Ultimately, 
a  "simple  problem"  will  be  shown  to  be  £  -decidable,  thus 
making  all  the  problems  S  -decidable.   We  recall  that  the 
notation   (?g)T(6)   denotes  the  problem  to  determine  for  an 
arbitrary  word  3  whether  T(6)  is  true;  if  G*  is  an  HNN  exten- 
sion of  G  with  stable  letters   p   and   w  is  any  word  in  G  , 
then  p[w]   is  the  p-reduced  word  w;  llwll  is  the  word  w  with 
aa    and  a   a   deleted.   U  =  V   means  that  U  and  V  have  exactly 
the  same  spelling. 
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w    =    1    in   G 


6.2.1 


6.2.1 


(?k-free   V) (3D) 
V   =    D    in    G 


(?k-free   V) 
V    =    1    in   G. 

6.2.2 

(?P)P=1    in   G2 
6.2.10    /  \6.2.10 


(?F)F=1 
in    G^  *<q ,q, > 


(?X,Y)Xq.Y=q    in    S 


(?F)  (3n)F=x       in    G^*<q,qj^> 
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We   designate  certain  variables  to  range  over  special 
types  of  words: 

Variables  Range 

D  products  of  the  words  x,r.,q   tq  (and  inverses) 

P  k,t-free  words  of  G,  i.e.,  words  of  G„ 

R,R^ ,R_       words  on  x  and  r.  ,  i  s  i 

F  k,t,r.  (i  G  I)  -free  words  of  G,  i.e.,  words 


of  G^  *  <q,q-|^: 


products  of  the  words  F.q.  Gi,  fv^x, 
b  =  1 , . . . ,M  (i  fixed) 


B  products  of  the  words  H.q.  K.,  f,x 

b  =  1 , . . . ,M  (i  fixed) 


M,N         k , t ,q , r . -f ree  words  of  G,  i.e.  words  of  G, 


X,Y         positive  words  on  f,   ,  b  =  1 ,  .  .  .  ,M 


V  and  w  will  stand  for  arbitrary  words  of  G 
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5 . 2   Proofs 

For  the  rest  of  this  section,  we  take  n  ^  3. 

Leirona    6.2.1  (i)    w    =    1    in   G    is    reducible    to 

(1)  (?k-free   V) {3D)V   =   D   in   G^ 

(2)  (?k-free   V)V   =1  ^^   ^2 

(ii)   If  (1)  and  (2)  are  S  -decidable,  then  G  has  word  problem 

on 
in  £  . 


n 


Proof:   (i)  5.1.10 

(ii)  5.2.4  (and  5.2.3)  . 

Lemma  6.2.2    (i)  (?k-free  V)V  =  1  in  G^  is  reducible  to 

(1)  (?P)  (3R)P  =  R  in  G^ 

(2)  (?P)P  =1      i^  ^2 

(ii)   If  (1)  and  (2)  are  £  -decidable,  then  G^  has  word 

problem  in  £  . 

Proof:   (i)  5.1.10. 

(ii)  5.2.4  (and  5.2.3)  . 


Lemma  6.2.3    (i)   (?k-free  V)  (3D)  (V  =  D  in  G^)   (*)  is  reducible 

in  G2 


(ii)   If  (1)  and  (2)  are  s"-decidable ,  then  (*)  is  S^-decidable-. 

Proof:   (i)   Given  an  arbitrary  k-free  word  V,  first  t-reduce  V 
(this  is  possible  using  (1)).   Let  t[V]  be  the  t-reduced  word. 
Then,  (3D)V  =  D  in  G   *»  (3D)t[V]  =  D  in  G^ .   Since  every  word 
in  G-  is  equal  to  some  t-reduced  word,  we  can  take  D  to  be 
t-reduced.   There  are  two  cases  to  consider: 


to 

(1) 

(?P)  (3R)P   =   R 

(2) 

(?P)  (3Rt  ,R^)R 
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(a)   t[V]  is  t-free 
Then  t[V]  =  D 
Thus,  if  t[V]  is  t-free,  then 


Then  t[V]  -   D  and  D  t-reduced  =»  D  =  R  =  word  on  r.  and  x, 


{3D)t[V]  =  D   in   G^   **   (3R)t[V]  =  R   in   G 

(b)   t[V]  is  not  t-free. 

Then  t[V]  =  Ut^P   where  e  =  +  1,  P  is  t-free. 

(**)   If  Ut^P  =  D  (D  t-reduced),  then  by  5.1.11, 

-     -1  e  -1  -1 

D  =  D'q   t  qR,  and  PR^  q    is  a  pinch;  i.e., 

-1-1 
PR,  q    =  R  in  G_  ,  where  R  is  a  word  on  r.  and  x. 

-1-1  ■"" 

Hence,  R   PR   =  q  . 

(***)  On  the  other  hand,  if  (3R,,R  )R,PR„  =  q,  then 

Ut^P  =  UPP~"'"t^P  =  UPR  (R~"'"P~"'"R~'^)R  t^PR  r""*" 

-1  e   -1 
=  UPR2q   t  qR2  . 

And  so,  (3D)Ut^P  =  D  =>  (3d)UP  =  D. 
But  also,  (3d)UP  =  D  =>  (3D)Ut^P  =  D. 
Hence,   (3D)Ut^P  =  D  *>  (3D)UP  =  D. 


Thus,  for  t[V]  not  t-free,  the  procedure  to  decide  (*) 
would  be  as  follows:   Decide  (using  (2))  for  t[V]  =  Ut  P 
if  R, PR„  =  q  in  G„  for  some  R,  and  R„ .   If  not,  then  by 
(**) ,  not    (3D)V  =  D  in  G  .   Otherwise,  apply  the  same 
procedure  to  UP  (by  (***)  ,  the  number  of  t's  in  the  word 
to  which  the  procedure  is  applied  decreases  by  one  with 
each  application  of  the  procedure) .   Do  this  until  there 
are  no  more  t's   and  then  use  (1). 

(ii)    By  (5.2.3),  if  (1)  is  S^-decidable ,  then  V  can  be 
t-reduced  in  <   f(length(V))  steps,  f  e  8.^.      Note  that  since 
the  relations  involving  t  are  txt    =  x,  tr.t    =  r.  ,  we 
can  t-reduce  merely  by  deleting  t's  and  length  (t[V])  _<  length  (V) 
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The  number  of  times  (2)  needs  to  be  decided   depends  on  the 
number  of  t's.   Also,  by  (**),  the  word  for  which  (2)  needs 
to  be  decided  is  a  subword  of  the  original  word  with  t's 
removed.   Therefore,  if  (2)  is  decidable  in  time 
<_   gdength  (V)  )  ,  then  (*)  is  decidable  in  time 
£  f (length(V) )  +  (number  t ' s) (g (length (V) ) ) 
<  f (length (V))  +  (length (V) ) (g (length (V) )) ,  which  is  in  &^ 
if  f  and  g  are.  ^^ 


Lemma  6.2.4    Let  M  be  a  word  on  f,  and  x,  i.e.,  M  G  G.,  . 
b  1 

Let   C  =  {fu^|b  =  1,...,M}.  Then  the  number  of  steps  needed 

-> 
to  obtain  f[M]  (the  f,  -reduced  word  for  all  b)  is  in  £  .  j-. 

Proof:   (5.2.8)  . 


Lemma  6.2.5    (i)   LetAbea  word  on  {F.q.  G.,f,x,.,.  ,f..x} 
1^1,  1   1      '  M 

(i  fixed)  .   Then  A'  =    II  All  is  f-reduced  and  q-reduced . 

(ii)   Let  B  be  a  word  on  {H.q.  K .  , f , x~  , . .  .  , f„x~  } 

1^1-  1   1  M 

(i  fixed).   Then  B'  =    IIbH  is  f-reduced  and  q-reduced. 

Proof:   Inspection  of  the  spelling  of  A'  and  B'  shows  that 

A'  and  B'  can  contain  no  subword  f ,  x  ^f~      or  f,"  x"f,  .     „ 

k     k       k     k      D 

Remark  6.2.6    By  5.1.11,  if  F  is  f-  and  q-reduced  and 

(3n)x  F  =  A  or  (3n)Fx   =  A  (where  A  is  f-  and  q-reduced), 

then  the  sequence  of  q.  's  and  f ,  ' s  in  F  and  A  must  be 

lb 

identical . 

From  6.2.5,  we  can  conclude  the  following: 

Lemma  6.2.7    Let  w   be  the  word  w  with  x's  deleted.   If  F 
is  f-  and  q-reduced   and  F  contains  q?  (e  =  +  1) ,  but  no 
subword  F'  where  (F')   =  (F.q.  G.)^  then 


> 


l-"!,   1 
(i)     (^n)  x"f  =  a   in   G   *  <q,q, 

(ii)   (Jn)    Fx"  =  A   in   G,  *  <q,q  > 

(iii)  (3^A)  F  =  A     in   G,  *  <q,q,>  . 

(An  analogous  statement  holds  with  "A"  replaced  by  "B"  and 

"F.q.  G."   replaced  by   "H.q.  K."). 
1  i-|  1  -'--'-?  ^ 
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Note ;   If  F  is  f-reduced,  then  llFll  is  q-reduced. 

3 

Lemina  6.2.8    Each  of  the  following  is  £  -decidable: 

(i)  (?F)  (3n)  (3A)  Fx"  -  A   in   G   *  <q,q-L> 

'-^  *   <q.q3_: 
'i   *  <q.qj^- 

Proof:   A  proof  will  be  given  for  (i) .   All  the  rest  are 


(ii)  (?F) {3n) (3B)  Fx"  =  B   in   G^  *  <q,qT> 

(iii)  (?F) (3n) (3A)x"f   =  A   in   G^  *  <q,qT> 

(iv)  (?F) (3n) {3B)  x^F  =  B   in   G   *  <q,qn> 


similar.   To  decide  (i) ,  first  f-  and  q-reduce   F.   Let 
[F]  be  the  f-  and  q-reduced  word.   Check  if  [F]  contains  a 
subword  of  the  form  given  in  6.2.7   whenever  [f]  contains 
q.  ,   If  not,  then  not     {3n) (3a)Fx"  =  A,   If  so  (or  if  F 
contains  no  q .  ) ,  then  by  6.2.6   we  know  that  word  A    Fx 
has  to  be  equal  to  if  indeed  (3n) (3A)Fx   =  A,   Thus,  we  can 
generate  a  word   A,  whose  sequence  of  q's  and  f's  is  the  same 

—  1  ■n 

as  that  of  [F]  and  check  if  A,  F  =  x  . 

3 
To  show  £  -decidability,  suppose  that  to  f-  and  q-reduce 

F  requires  <_   g  (length  (F))  steps.   If  (3n)(3A)  Fx^  =  A,  then 

length  (A)  <_   2    length  (F),  for  the  number  of  f's  in  A  is 

<_  the  number  of  f's  in  F,  and  the  number  of  x's  in  A  is 

<_   the  number  of  f's  in  A.   Thus,  to  decide  "A,  [F]  =  x"" 

requires  <_   g  (g  (length  (F)  )  +  2-length(F))  steps,  and  so  to 

decide  (i)  requires  <_   2 -g  (  2 -g  (length  (F)  )  )  steps,  which 

IS  m  £  .  |--| 

Corollary  6.2.9    The  following  are  S  -decidable: 

(i)  (?F)  (3A)F  =  A   in   G^  *  <q,q3_> 

(ii)  (?F) (3B)F  -  B   in   G^  *  <q,q^>. 

Proof :   Same  as  6.2.8,  except  that  instead  of  checking 

A   F  =  X  ,  we  check  whether  A   F  =  1  .  r-. 
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Lemma  6.2.10 


(i; 


(2) 
(ii) 


(i)  (?P)P 

(?F) (3A)F  =  A  in 

(?F) (3B)F  =  B  in 

(?F)F      =  1  in   G 


1  in  Gp  is  reducible  to: 


G^  *  <q,q^> 
G-^    *    <q,q^> 


3_  *  <q/q3_> 


has  word  problem   in  £  . 


Proof:   (i)   5.1.10. 

(ii)     Recall  from  the  previous  lemma  that   if  (3A)F  =  A, 
then  length  (A)  <_   2  •  length  (F).   Also,  if  r~  A '  r  .  =  B',  then 
length(B')  <  c-length(A')  where  c  is  a  constant.   (A  similar 


statement  holds  for  "B"  in  place  of  "A".)   By  5.2.2,  the 

3 
number  of  steps  needed  to  r-reduce  P  is  in  £  .   It  remains 

3 
to  show  that   G,  *  <qfqi>   has  word  problem  in  £  .   To  decide 


'1 


if  F 


1  in   G,  *  <q,qi>  we  just  f-reduce  and  then  see  if 
11  2 

f[F]ll  -    1,    and  this  procedure  is  in  £  . 


Corollary  6.2.11    G,   and  G   have  word  problems  in  £ 
Proof:   G,  and  G^  are  subgroups  of  a  group  having  word 


problem  in  £' 


0 


Lemma  6.2.12 


(i)   (?P)(3R)P  =  R  in  G   is  reducible  to:   (*) 


(1) 


(2; 


(3) 
(ii) 


(?F) 
(?F) 
(?F) 
(?F) 
(?F) 
(?F) 
(?F) 


3A)F  =  A   in   G,  *  <q,q^> 


3B)F  =  B   in 
n 


G^  *  <q,q3^> 


3n)(3A)Fx   =  A   in   G^  *  <q,qj_> 
3n)  (3B)Fx"  =  B   in   G^  *  <q,q-,^> 


3n)  (3A)x  F  =  A   in   G^ 


*  <q,q-^> 


3n)  (33)x  F  =  B   in   G^  *  <q,q-[^> 


3n)F  =  X 


in   G^  *  <q,q^> 


(*)  is  £  -decidable 
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> 


Proof :  (i)  Given  P,  to  decide  if  (3r)p  =  R  in  G  ,  first 
r-reduce  P.  Let  r[P]  be  the  r-reduced  word  P.  There  are 
two  cases: 

(a)  r [P]  is  r . -free  (i  e  i)  . 

Claim:   (3R)  r[P]  =  R  in  G^  **  (3n)r[P]  =  x"  in  G   *  <q,q-. 

Proof  of  claim:  (*=)   Clear. 

(=>)  Suppose  (3R)  r[P]  =  R  in  G^.   Then 
(3r  which  is  r-reduced) r [P]  =  R  in  G-.   By  5.1.11  r[P]  and  R 
must  contain  the  same  number  of  r's.  Therefore  R  contains  no 
r's,  and  so  (3n)R  =  x   in  G„.   But, 

(3n)r[P]  =  x"^  in  G^   <>    (3n)r[P]  -  x"  in  G^  *  <q,q^>. 

(b)  r[P]  is  not  r.-free  (i  e  I)  . 

If   r[P]  contains  only  r's  and  x's,  then  we  are  done. 
Suppose  that  the  last  and  first  letter  of  r[P]  is  not  r^ 
(i  s  I ,  e  =  +  1)  or  X   (e  =  +  1)  (else  we  may  consider  the 

word  without  the  final  r^  or   x^) .   Then  r[P]  =  F,r.  P, 

^2  ^  1 

=  ^2^1  ^2   ^here   e,  =  +  1,   ^2  =  +  1,  and  F  ,F   are  words 

of  G^  *  <q,q3^>. 

(**)  If  e-j^  =  1  and  (3n)  (3A)x"f   =  A  in  G^  *  <q,q  >,  then 

F,r.  P,  =  x~"r.  r~"'"x'^F,r.  P,  =  x~"r .  rT-'-Ar.  P,  =  x~"r .  BP, 
1  13^  1        1^  1^    1  1^  1        i-L  1^   1^  1        1^   1 

and  so   (3R)P  =  R  ^  (3R)BP   =  R  , 

If   e   =  -1  and  (3n)  (3b)x'^F   =  B  in  G,  *  <q,q,>,  then 

F^r.-'-p,  =  x'^r'-'-r.  x'^F^r""'"  =  x"'^r~-'-AP, 

1  1-,  1        -"-1  •'-I    -*-  -"-i        -'-1   -*- 

and  so   (3R)P  =  R  *»  (3R)AP,  =  R  . 

If   e^  =  -1  and  (3n) (3A)F  x"  =  A  in  G,  *  <q,q,>,  then 

P„r.^F„  =  P-r"-'-F_x'^r.  rT^x""  -  P_Br~-'-x"" 

2  I2  2     2  I2  2    I2  I2        2   I2 

and   (3r)P  =  R  *>  (3R)P2B  =  R  . 


69 


If   e-  =  1   and   (3n)  (3B)Fx   =  B  in  G   *  <q,q  >,  then 

P_r.  F„  =  P-,r.  F„x"r.  r.  x  "  =  P„Ar.  x  " 
2  I2  2     2  I2  2    I2  I2        2   I2 

and   (3R)P  =  R  *>  (3R)P2A  =  R  . 

(***)  On  the  other  hand,  if  (3R)r[P]  =  R,  then 

^1  ^2 

(3R)F^r   P   =  R  =  P^r   F2 

-1    ^1  ^2    -1 

i.e.,  R  -^F^r.  P^  =1      =  P„r.  F^R    . 
1  1,  1  2  !„  2 

-1    ^1 
By  Britton's  lemma,   R   F,r.  P,   contains  a  pinch 

-^1  n^      ^^ 
r  .   X  F,  r  .  . 

If   e,  =  1,  then  x"f,  =  A  in  G   *  <q,q  >. 

If   e,  -   -1,  then  x  F  =  B  in  G,  *  <q,q,>. 

^2    -1 
Similarly  for  P_r.  F„R    =  1  . 

2  I2  2 

Thus,  for  r[P]  not  r-free,  the  procedure  to  decide  (3R)P  =  R 

in  G„   would  be  as  follows: 

Decide  (using  (2) ) 

if  (3n)x  F^  =  A   when   e,  =   1   1  e-i 

\    for  P  =  F  r.  P 
(3n)x'^F^  -  B   when   e^^  =  -1   J  "^  ^1  ^ 

or     if  (3n)F_x   =  A  when   e„  =  -1   1  e^ 

>  for   P  -  P^r.  F 
(3n)F2x"  =  B   when   e2  =   1   J  ^  ^2 

If  not,  then  by  (***)  not    (3  R)  P  =  R. 

Else  apply  the  procedure  to  the  resulting  word  AP, ,  BP, 
P„B,  or  P„A  (as  given  by    (**)).   On  each  application 
of  the  procedure  the  number  of  r's  in  the  word  that  it 
is  applied  to  decreases  by  one.  Thus,  it  can  be  applied 
at  most  a  finite  number  of  times. 
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(ii)    By  the   proof  of  6.2.10,   the  number  of  steps  needed 

to  r-reduce  P  is  f  (length  (P)  )  ,  f  6  S"^.   The   length  of  the 

resulting  r-reduced  word  is  <_  f  ( length  (P)  )  .   By  6.2.8,  (2) 

3 
is  £  -decidable.   Suppose  that  to  decide  (2) ,  g(x)  steps 

are  needed   for  a  word  of  length  x  (g  a  nondecreasing  function) 

Then  for  one  application  of  the  procedure  for  (2), 

<_   g  (f  (length  (P)  )  )  steps  are  needed.   The  length  of  the  word 

to  which  the  procedure  for  (2)  is  applied  again  (as  in  (**)) 

is  <_   c  •  f  (length  (P)  )  where  c  is  a  constant  (as  in  the  proof 

of  6.2.10),  and  the  maximum  number  of  times  the  procedure 

for  (2)  is  used  is  <_   length(P).   Therefore  the  number  of  steps 

needed  to  do  (1)  and   (2)  is  <  length  (P)  -g  (c^^'^'^^'^  ^^^ 

•  f (length (P) )) .   The  proof  will  be  completed  by  showing 

that  (?F)  (3n)F  -  x"^  in  G   *  <q,q  >  is  S'^-decidable .   It  is 

easily  verified  that  to  do  this,  just  f-reduce  F,  obtaining 

f[F];  then  (3n)F  =  x'^  if  and  only  if  llf[F]ll  =  x'^,  and  this 

procedure  is  in  £  . 

Theorem  6.2.13    G^  has  word  problem  in  £  . 

Proof:       6.2.10    and    6.2.12.  !_. 


D 


Lemma    6.2.14       (i)     (?P)  ( BR-^ ,  R2)  R-lPR2    =   ^    ^^   '^2    ^^    reducible    (*; 
to  • 


(1) 


(2: 


(3) 


(?F)  ( 

(?F)  ( 

(?F)  ( 

(?F)  ( 

(?F)  ( 

(?F)   ( 

(?F)  ( 

3A) F  =  A  in  G,  *  <q,q, > 
3B)F  =  B  in  G,  *  <q,q-,> 
3n)  (3A)Fx'^  =  A   in   G,  *  <q,q-,: 


3n)  (3B)Fx   =  B   in   G  *  <q,qn> 

3n)  (3A)x  F  =  a   in   G,  *  <q,q-,> 

3n)  (3B)x"f  =  B   in   G  *  <q,qi> 

3R-,  ,R2)  R-,  FR  =  q  in  G^  . 


(ii)    If  (1),  (2),  and  (3)  are  £' -decidable ,  then  (*: 
is  £-decidable. 
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Proof:   (i)   is  almost  identical  to  that  of  6,2.12  (i) . 

(ii)     By  the  proof  of  6.1.12  (ii),  to  reduce  (*)  to  (3) 
requires  <_   h  (length  (P))  steps,  h  e  S  .   We  will  show  that 
(3)  is  £  -decidable. 

Claim:    (3R,,R  )R,FR   =  q  in  G-  ** 


and 


3m)  (3n)  (3X)  (3Y)F  =  x"^Xq.Yx" 


(Xq  .Y)  *  =  q  in  S 


Pooof  of  claim:   (•=)   Let  T.    -   Xq.Y.   Then  Z  satisfies  the 

-'  -1  -1        -1  -1 

hypothesis  of  Boone's  lemma  (6.1.2)  and  sok  Z      tZkZ   t   1=1 

in  G.   Since  G  is  an  HNN  extension  of  G-.  with  stable  letter  k, 

-1 
by  Britton's  lemma,  Z    t  Z  =  V  in  G-^  where  V  is  a  word  on  r. 

-1  ^ 

(i  £  I),  X,  q   tq .   Without  loss  of  generality  we  can  take  V 

-1  e 
to  be  t-reduced.   Then  V  E  V R^q   t  qR, .   Now, 

Z"-^  t  Z  V"-^  =  Z"-*-  t  Z  Ri"'-q""'-t~^qR~"^  V'"^  =  1  in  G^  . 

By  Britton's  lemma,  Z  R~-'-q~   is  a  pinch,  i.e., 

-1-1 
Z  R^  q    =^  R  in  G„  ,  where  R  is  a  word  on   r.  (i  s  i)  and  x. 

Hence  R    Z  R    =  q  in  G„ .   Since  F  =  x   Z  x  ,  we  have  the 

desired  result. 

(=>)   Suppose  (3r  ,R  )R  FR   =  q  in  G^-   (We  can  take  R   and  R2 

to  be  r-reduced.   Then  R'-'-qR"  F~   =  1  in  G„.   By  Britton's 

-1-1  1    ^  -^ 

lemma,  R,  qR_   can  be  successively  r-reduced.   But  on 

r-reducing,  all  the  f's  to  the  left  of  q  have  negative 

exponents,  and  those  to  the  right  have  positive  exponents. 

Since  x's  can  be  moved  easily  to  the  right  of  f,  if  e  =  +1 

and  to  the  left  if  e  =  -1,  F  has  the  desired  form.   To  decide 

3 
if  F  is  of  the  required  form  is  in  £  :   f-reduce  F  and  check 

if  the  resulting  word  is  of  the  desired  form  by  a  procedure 

similar  to  6.2.6.   If  not,  then  not    (3  R-,  ,  R^)  R-,  FR2  =  q  in  G2 . 

If  so,  then  since  Xq.Y  =  q  in  S  in  £  -decidable,  (3)  is 

n 
£  -decidable. 
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6.3   £  -Decidable  Groups 

Theorem  6.3.1    For  n  ^  4,  there  exist   finitely   presented 
groups  with  word  problem  strictly  in  £  .  p. 

Proof:   Lemmas  of  Section  6.2. 

If  the  relation  kq   tq  =  q   tqk  is  omitted  in  the 

construction  of  the  groups  in  the  previous  sections,  then  the 

3 
resulting  group  has  word  problem  in  £  .   Thus,  G  has  a  hard 

"subgroup  problem". 

Theorem  6.3.2    For  n  >  2,  there  exist  finitely  presented 

D 


groups  with  word  problem  in  £ 


Proof:   6.3,1,  5.2.6,  5.2.7. 

The  "simplest  groups",  the  free  groups,  have  word  problem 

iel 
2 


2 

in  £  .   It  is  now  known  whether  the  word  problem  for  free 


groups  can  be  lower  than  £' 
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CHAPTER  7 
OPEN  QUESTIONS 

(1)  Can  some  of  the  embedding  theorems  for  groups  be  shown 
to  be  special  cases  of  embedding  theorems  for  logical 
theories?   Sabbagh  [1974]  notes  that  the  group  theoretic 
analogue  of  the  theorem  in  logic  that  a  decidable  theory 
has  a  complete  decidable  extension  is  that  a  group  having 
solvable  word  problem  can  be  embedded  in  a  simple  group 
having  solvable  word  problem.   But  there  is  no  connection 
between  the  proofs.   In  the  theorem  from  logic,  the  result- 
ing extension  has  the  same  language  as  the  original  theory. 
In  the  group  embedding  theorem,  the  simple  group  has 
additional  generators.   So  to  show  any  connection,  it  seems 
as  if  it  is  necessary  to  consider  theories  extended  by  adding 
new  symbols.   It  would  be  especially  interesting  to  see  if 
the  Higman  embedding  theorem  is  a  special  case   of  a  theorem 
from  logic. 

(2)  Can  the  word  problem  for  groups  be  proved  unsolvable  by 
constructing  a  group  in  which  all  the  generators  are  functions, 
similar  to  the  way  in  which  the  word  problem  for  semigroups 
was  proved  unsolvable  by  J.  Robinson  [1968]?   This  method 
would  have  the  advantage  of  presenting  a  group  as  a  concrete 
system . 

(3)  The  ^a  -decidable  groups  constructed  in  this  paper 
were  a  series  of  four  HNN  extensions.   Is  it  possible  to 
construct  £  -decidable  groups  which  are  structurally  simpler? 
C.  Miller  [1971]  has  shown  that  the  lowest  HNN  extension 
having  unsolvable  word  problem  is  of  order  two.  It  may  be 
possible  to  use  his  technique  in  constructing  simpler  £"- 
decidable  groups . 
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